Question: vcftools nucleotide diversity statistic (pi)
1
gravatar for modi2020
12 months ago by
modi202030
United States
modi202030 wrote:

Dear fellows:

I know that Nei's Pi (nucleotide diversity statistic) is calculated per site using sequences belonging to more than one individuals. However, in vcftools, it seems like they can be calculated on a per individual bases (https://sourceforge.net/p/vcftools/mailman/message/29923488/). In another post by vcftools, it was defined as "the average number of differences between a pair of chromosomes" (https://sourceforge.net/p/vcftools/bugs/42/) which means to me that only one individual is enough for its calculation. We plan to use this method in a paper we are about to write since we don't have more than one individual per breed. Does anyone know the formula used in vcftools or if it is at all valid to calculate Pi in this way?

I would really appreciate your kind advice on this.

vcftools • 1.6k views
ADD COMMENTlink modified 12 months ago by caw5cv10 • written 12 months ago by modi202030
1
gravatar for caw5cv
12 months ago by
caw5cv10
caw5cv10 wrote:

Nucleotide diversity is a population-level metric, the average number of differences between a pair of chromosomes, across all chromosome combinations within the population. If you only have one individual, maybe you're looking to report heterozygosity instead?

ADD COMMENTlink written 12 months ago by caw5cv10

Thank you so much caw5cv. I realize that its a population level metric. However, it seems like in vcftools its calculated on a per individual basis for SNPs in non-overlapping windows. Is that still useful if one had SNPs called on a single genome.

ADD REPLYlink written 12 months ago by modi202030
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2266 users visited in the last hour