Question: Find SNPs in the exome from WGS data
gravatar for jh
2.2 years ago by
jh10 wrote:


I have called variants in WGS data with GATK. Now I would like to filter out alla heterozygous SNPs in the exome and which genes they are present in.

The output should be structured:

chrom | position | GT (nucleotides) | gene

1 | 100 | A/T | geneA

1 | 200 | G/T | geneA

2 | 100 | A/C | geneB

Thanks for any help!


sequencing snp gene • 763 views
ADD COMMENTlink modified 2.2 years ago • written 2.2 years ago by jh10

Thanks a lot for the advice, VEP works great!

I can't figure out how to print the reference allele as a separate column.

Does anyone know?


ADD REPLYlink written 2.2 years ago by jh10
gravatar for andrew.j.skelton73
2.2 years ago by
andrew.j.skelton735.2k wrote:

You could use Ensembl's VEP to annotate all your filtered variants, then filter based off that. Alternatively, you could try Gemini, which would allow you to make complex and flexible queries around large sets of data (based on SQLite)

ADD COMMENTlink written 2.2 years ago by andrew.j.skelton735.2k

I would highly recommend GEMINI (which you can use on VEP or SnpEff annotated VCF files). It will let you do family-wise queries and lots of different ways of slicing the data. But even for the simple case you describe I think you'll likely find all of the additional capabilities appealing.

ADD REPLYlink written 2.2 years ago by Dan Gaston7.0k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1085 users visited in the last hour