Question: Proxy SNPs from 1000 genome phase3
gravatar for bayoteju
20 months ago by
bayoteju10 wrote:

I have a list of about 250 SNPs. I want to obtain the proxy SNPs for them using the 1000 genomes. I have downloaded corresponding .vcf and .ped files from 1000 genomes. I have also used PLINK to obtain the .bim, .fam and .bed files.

How do I proceed to determine the proxy SNPs without using any software?

Thank you.

ld plink snp • 757 views
ADD COMMENTlink modified 20 months ago • written 20 months ago by bayoteju10

Thank you very much. I am using snpStat to calculate the LDs.

ADD REPLYlink written 20 months ago by bayoteju10
gravatar for zx8754
20 months ago by
zx87549.7k wrote:

Get LD for selected SNPs using plink --r2. Then pick any proxy SNP having highest LD score.

Why not using other software? We could, for example, use PriorityPruner which is specifically designed to pick proxies with prioritisation, i.e. when SNP1 is in LD with SNP2 and SNP3, it will prioritise based on other features of the SNP (like pvalue, frequency, annotations, etc.).

PriorityPruner is a software program which can prune a list of SNPs that are in high linkage disequilibrium (LD) with other SNPs in the list, while preferentially keeping SNPs of higher priority (e.g., the most significant SNPs in a genome-wide association study). The process of pruning SNPs based on LD is sometimes referred to as "LD clumping".

ADD COMMENTlink written 20 months ago by zx87549.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1105 users visited in the last hour