What is the best method to map snps called using GATK to corresponding genes?
1
1
Entering edit mode
7.3 years ago

Hi,

I have a list of snp's in a vcf file called using GATK for a plant genome. Now, the objective is to map those SNP's to corresponding genes in the genome (I have the reference genome and corresponding GFF file with me) since I have the SNP positions in the POS column of vcf along with the chromosome name and not gene name.

What is the best approach to achieve the task? What I could think of is:

vcf to bed conversion (by some tool, vcftools ?, please suggest if you can) and then getting the corresponding gene names from the GFF file (bedtools; may be?).

Any other alternative approach?

PS: I googled and found many tools specifically designed to annotate human SNP data, while could not find any for plant.

SNP GATK gene • 2.7k views
ADD COMMENT
0
Entering edit mode
7.3 years ago
Medhat 9.7k

To convert from vcf to bed you can use vcf2bed from here


To annotate I use SnpEff as long as you have the genome and gtf/gff it will work; but first the tool have command to search if there database contains annotation for your genome of interest

java -jar snpEff.jar databases | less 
or 
java -jar snpEff.jar databases | grep -i "your genome"

after that use the result to annotate your vcf

java -Xmx5g -jar snpEff.jar result_from_previous_step your.vcf > your.ann.vcf

other wise you can use another tools supported by the software to create your annotation database

ADD COMMENT
0
Entering edit mode

Thanks, I downloaded and checked the database for the presence of my genome of interest (i am looking for sesame indicum) without luck! From a first look at the file, it seems it only supports bacterial genomes. Any further suggestions?

ADD REPLY
0
Entering edit mode

as I wrote if the database does not exist you can create it easily read Building database section 9

ADD REPLY

Login before adding your answer.

Traffic: 2052 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6