Question: Identification of SNPs located on genes
0
gravatar for valerie
2.2 years ago by
valerie60
valerie60 wrote:

Hi guys,

I have a VCF file with SNPs located on the whole genome and I want to identify those that are located on genes. I have converted VCF file to Genotype matrix and wrote a python script that goes through both GM file and GTF one to find SNPs located in genes. But this takes a huge amount of time. Does somebody know if there exists a standard tool that solves my problem in an optimal time-efficient way?

Thanks!

snp vcf • 669 views
ADD COMMENTlink modified 2.2 years ago by WouterDeCoster37k • written 2.2 years ago by valerie60
0
gravatar for WouterDeCoster
2.2 years ago by
Belgium
WouterDeCoster37k wrote:

If you have a VCF file then you can easily use SnpEff, Annovar or VEP for annotation.

ADD COMMENTlink written 2.2 years ago by WouterDeCoster37k

Thank you for your advice! Actually my goal is to study genetic differences between two groups of mice (3 mice in each group). My VCF file contains information about all the samples. After converting to genotype matrix I selected only those SNPs that differ two groups. And now I want to understand what genes are involved in this differences. Is it possible to use the tools you mentioned to solve this problem?

ADD REPLYlink written 2.2 years ago by valerie60

Most convenient would probably be to filter your vcf now with what you found when selecting variants differing between the two groups.

ADD REPLYlink written 2.2 years ago by WouterDeCoster37k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2451 users visited in the last hour