Question: Genetic measures calculation in vcf files
gravatar for ricardo
3.4 years ago by
Brazil/Fiocruz/Minas Gerais
ricardo40 wrote:


I have a set of files in vcf format. They were obtained by mapping against a reference genome. After mapping the reads and identifying the SNPs, a selection of these was made based on a list of 40 genes. This was done for data from 18 different genomes. This way I have 18 vcf files, with the variations found in these genes of interest.

I would like to know what tools I could use to calculate some measures of genetic variability like pi, TajimaD and others. It is my interest to compare these values, since the data comes from various locations around the world.

I've tried using vcftools, but the results were not consistent, or I got error in the values (nan).

tajimad pi vcf • 1.2k views
ADD COMMENTlink modified 3.3 years ago by willgilks300 • written 3.4 years ago by ricardo40
gravatar for willgilks
3.3 years ago by
United Kingdom
willgilks300 wrote:

Hi Ricardo,

I think you're getting values of NaN because you are only analysing one individual. Population genetics requires more than one individual, of course.

Combine the separate vcf files from each individual into one, then analyse.

GATK is good for this function

ADD COMMENTlink written 3.3 years ago by willgilks300
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1535 users visited in the last hour