Question: Proxy SNPs from 1000 genome phase3
gravatar for bayoteju
12 weeks ago by
bayoteju10 wrote:

I have a list of about 250 SNPs. I want to obtain the proxy SNPs for them using the 1000 genomes. I have downloaded corresponding .vcf and .ped files from 1000 genomes. I have also used PLINK to obtain the .bim, .fam and .bed files.

How do I proceed to determine the proxy SNPs without using any software?

Thank you.

ld plink snp • 420 views
ADD COMMENTlink modified 11 weeks ago • written 12 weeks ago by bayoteju10

Thank you very much. I am using snpStat to calculate the LDs.

ADD REPLYlink written 11 weeks ago by bayoteju10
gravatar for zx8754
12 weeks ago by
zx87547.5k wrote:

Get LD for selected SNPs using plink --r2. Then pick any proxy SNP having highest LD score.

Why not using other software? We could, for example, use PriorityPruner which is specifically designed to pick proxies with prioritisation, i.e. when SNP1 is in LD with SNP2 and SNP3, it will prioritise based on other features of the SNP (like pvalue, frequency, annotations, etc.).

PriorityPruner is a software program which can prune a list of SNPs that are in high linkage disequilibrium (LD) with other SNPs in the list, while preferentially keeping SNPs of higher priority (e.g., the most significant SNPs in a genome-wide association study). The process of pruning SNPs based on LD is sometimes referred to as "LD clumping".

ADD COMMENTlink written 12 weeks ago by zx87547.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1467 users visited in the last hour