Question

Filtering ASE variants with those in high LD using plink

0

Entering edit mode

9.6 years ago

drautuna ▴ 60

I have whole genome data for variants from several individuals in (1).TFAM and (2).TPED format, as used by plink, as well as a (3)list of variants that are allele specifically expressed (ASE). All of the ASE variants are included in the plink .TPED. I need to find out which variants in the plink .TPED are in LD with each of the ASE variants - some ASE variants may have zero to several hundred variants in LD with it.

Eventually I want to have the original list of ASE variants and, for each, the variants that are in strong LD (defined has having an r-squared greater than a certain threshold) with the ASE variants.

So far, I have been looking at plink's LD calculation documentation, but I'm not 100% sure how to go about this using plink. Any help - some advice to a step-by-step tutorial would be appreciated.

Thanks

edit:

I have another question now; the output of the latter gives me the file plink.tags, which consists of all the names of the original ASE SNPs as well as the SNPs in LD with those original SNPs.

In the .TPED, it has the data for all the SNPs - I want to extract, from this file, data from only the SNPs in plink.tags. How might I go about doing this?

plink LD linkage-disequilibrium • 2.7k views

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by drautuna ▴ 60

Ram · Answer 1 · 2014-09-22

0

Entering edit mode

9.6 years ago

chrchang523 10k

plink --tfile [...] --show-tags [list of ASE variants] --list-all --tag-r2 [threshold]

should do what you want. (You may also want to raise the --tag-kb setting, depending on how stringent your r^2 threshold is.)

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by chrchang523 10k

0

Entering edit mode

Thanks, it worked!

I have another question now; the output of the latter gives me the file plink.tags, which consists of all the names of the original ASE SNPs as well as the SNPs in LD with those original SNPs.

In the .TPED, it has the data for all the SNPs - I want to extract, from this file, data from only the SNPs in plink.tags. How might I go about doing this?

ADD REPLY • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by drautuna ▴ 60

0

Entering edit mode

plink --tfile [...] --extract plink.tags --recode --transpose --out [new .tped/.tfam prefix]

should work.

ADD REPLY • link 9.6 years ago by chrchang523 10k