Question: Filtering ASE variants with those in high LD using plink
0
gravatar for drautuna
4.5 years ago by
drautuna60
United States
drautuna60 wrote:

I have whole genome data for variants from several individuals in (1).TFAM and (2).TPED format, as used by plink, as well as a (3)list of variants that are allele specifically expressed (ASE). All of the ASE variants are included in the plink .TPED. I need to find out which variants in the plink .TPED are in LD with each of the ASE variants - some ASE variants may have zero to several hundred variants in LD with it.

Eventually I want to have the original list of ASE variants and, for each, the variants that are in strong LD (defined has having an r-squared greater than a certain threshold) with the ASE variants.

So far, I have been looking at plink's LD calculation documentation:

http://pngu.mgh.harvard.edu/~purcell/plink/ld.shtml

but I'm not 100% sure how to go about this using plink. Any help - some advice to a step-by-step tutorial would be appreciated.

Thanks

edit: 

I have another question now; the output of the latter gives me the file plink.tags, which consists of all the names of the original ASE SNPs as well as the SNPs in LD with those original SNPs. 

In the .TPED, it has the data for all the SNPs - I want to extract, from this file, data from only the SNPs in plink.tags. How might I go about doing this?

 

ADD COMMENTlink modified 4.5 years ago • written 4.5 years ago by drautuna60
0
gravatar for chrchang523
4.5 years ago by
chrchang5234.8k
United States
chrchang5234.8k wrote:
plink --tfile [...] --show-tags [list of ASE variants] --list-all --tag-r2 [threshold]

should do what you want.  (You may also want to raise the --tag-kb setting, depending on how stringent your r^2 threshold is.)

ADD COMMENTlink written 4.5 years ago by chrchang5234.8k

Thanks, it worked!

I have another question now; the output of the latter gives me the file plink.tags, which consists of all the names of the original ASE SNPs as well as the SNPs in LD with those original SNPs. 

In the .TPED, it has the data for all the SNPs - I want to extract, from this file, data from only the SNPs in plink.tags. How might I go about doing this?

ADD REPLYlink modified 4.5 years ago • written 4.5 years ago by drautuna60
plink --tfile [...] --extract plink.tags --recode --transpose --out [new .tped/.tfam prefix]

should work.

ADD REPLYlink written 4.5 years ago by chrchang5234.8k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1243 users visited in the last hour