Question: vcftools nucleotide diversity statistic (pi)
2
gravatar for modi2020
2.4 years ago by
modi202040
United States
modi202040 wrote:

Dear fellows:

I know that Nei's Pi (nucleotide diversity statistic) is calculated per site using sequences belonging to more than one individuals. However, in vcftools, it seems like they can be calculated on a per individual bases (https://sourceforge.net/p/vcftools/mailman/message/29923488/). In another post by vcftools, it was defined as "the average number of differences between a pair of chromosomes" (https://sourceforge.net/p/vcftools/bugs/42/) which means to me that only one individual is enough for its calculation. We plan to use this method in a paper we are about to write since we don't have more than one individual per breed. Does anyone know the formula used in vcftools or if it is at all valid to calculate Pi in this way?

I would really appreciate your kind advice on this.

vcftools • 4.1k views
ADD COMMENTlink modified 2.4 years ago by caw5cv20 • written 2.4 years ago by modi202040
2
gravatar for caw5cv
2.4 years ago by
caw5cv20
caw5cv20 wrote:

Nucleotide diversity is a population-level metric, the average number of differences between a pair of chromosomes, across all chromosome combinations within the population. If you only have one individual, maybe you're looking to report heterozygosity instead?

ADD COMMENTlink written 2.4 years ago by caw5cv20

Thank you so much caw5cv. I realize that its a population level metric. However, it seems like in vcftools its calculated on a per individual basis for SNPs in non-overlapping windows. Is that still useful if one had SNPs called on a single genome.

ADD REPLYlink written 2.4 years ago by modi202040
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1439 users visited in the last hour