Question: Calculating Tajima's D for an individual genome vs. for population of genomes vs. for genes
gravatar for severalorks
14 months ago by
severalorks90 wrote:

I learned that Tajima's D compares the sum of nucleotide diversities for each pair of genomes to the expected nucleotide diversity of that population. After a selective sweep, the population is more homogeneous and so mutations that do occur are rare ones, meaning theta_T < theta_W and D<0. This made me think that one Tajima's D is calculated for an entire population.

However, I've also seen that you can calculate a Tajima's D for each individual in a population. My guess is that this individual statistic compares d_ij, where d is the pairwise difference b/w individuals i and j, to the population's Tajima's D?

Additionally, I've read that Tajima's D can be defined for individual genes, and also that: genes in the lower 5% tail of Tajima's D --> likely targets of selection. So how can Tajima's D be used to infer which genes are likely targets of selection? My guess is that Tajima's D is calculated on sliding windows along a genome.

ADD COMMENTlink modified 13 months ago by microfuge710 • written 14 months ago by severalorks90
gravatar for microfuge
13 months ago by
microfuge710 wrote:

I have never come across Tajima's D calculated for a single individual. Theta, Pi and Tajima's D are population level properties. 5% outlier cutoff is often used when 5% is of the distribution of Tajima's D for all the genes. Yes calculations are usually performed on the sliding windows or per gene for a population. I am quite happy with Variscan for doing this

ADD COMMENTlink written 13 months ago by microfuge710
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1483 users visited in the last hour