Question: Calculating Tajima's D for an individual genome vs. for population of genomes vs. for genes
gravatar for severalorks
4.7 years ago by
severalorks110 wrote:

I learned that Tajima's D compares the sum of nucleotide diversities for each pair of genomes to the expected nucleotide diversity of that population. After a selective sweep, the population is more homogeneous and so mutations that do occur are rare ones, meaning theta_T < theta_W and D<0. This made me think that one Tajima's D is calculated for an entire population.

However, I've also seen that you can calculate a Tajima's D for each individual in a population. My guess is that this individual statistic compares d_ij, where d is the pairwise difference b/w individuals i and j, to the population's Tajima's D?

Additionally, I've read that Tajima's D can be defined for individual genes, and also that: genes in the lower 5% tail of Tajima's D --> likely targets of selection. So how can Tajima's D be used to infer which genes are likely targets of selection? My guess is that Tajima's D is calculated on sliding windows along a genome.

ADD COMMENTlink modified 4.6 years ago by microfuge1.9k • written 4.7 years ago by severalorks110
gravatar for microfuge
4.6 years ago by
microfuge1.9k wrote:

I have never come across Tajima's D calculated for a single individual. Theta, Pi and Tajima's D are population level properties. 5% outlier cutoff is often used when 5% is of the distribution of Tajima's D for all the genes. Yes calculations are usually performed on the sliding windows or per gene for a population. I am quite happy with Variscan for doing this

ADD COMMENTlink written 4.6 years ago by microfuge1.9k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2100 users visited in the last hour