Question: Identification of SNPs located on genes
gravatar for valerie
2.9 years ago by
valerie70 wrote:

Hi guys,

I have a VCF file with SNPs located on the whole genome and I want to identify those that are located on genes. I have converted VCF file to Genotype matrix and wrote a python script that goes through both GM file and GTF one to find SNPs located in genes. But this takes a huge amount of time. Does somebody know if there exists a standard tool that solves my problem in an optimal time-efficient way?


snp vcf • 803 views
ADD COMMENTlink modified 2.9 years ago by WouterDeCoster42k • written 2.9 years ago by valerie70
gravatar for WouterDeCoster
2.9 years ago by
WouterDeCoster42k wrote:

If you have a VCF file then you can easily use SnpEff, Annovar or VEP for annotation.

ADD COMMENTlink written 2.9 years ago by WouterDeCoster42k

Thank you for your advice! Actually my goal is to study genetic differences between two groups of mice (3 mice in each group). My VCF file contains information about all the samples. After converting to genotype matrix I selected only those SNPs that differ two groups. And now I want to understand what genes are involved in this differences. Is it possible to use the tools you mentioned to solve this problem?

ADD REPLYlink written 2.9 years ago by valerie70

Most convenient would probably be to filter your vcf now with what you found when selecting variants differing between the two groups.

ADD REPLYlink written 2.9 years ago by WouterDeCoster42k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1314 users visited in the last hour