Forum: How to calculate Tajima's D and Fay & Wu's H for unphased data?
0
gravatar for RoseString
3.1 years ago by
RoseString0
Atlanta
RoseString0 wrote:

Hi,

I have a small number of samples (~10) for my species of interest (non-model organism), so it's almost impossible to phase the data. I am interested in doing some site-frequency spectrum methods to detect positive selection in the genome, but they require the calculation of nucleotide diversity (pi). Is it possible to do so without phasing the data?

Thanks in advance!

ADD COMMENTlink modified 3.1 years ago by jsgounot130 • written 3.1 years ago by RoseString0
0
gravatar for jsgounot
3.1 years ago by
jsgounot130
European Union
jsgounot130 wrote:

Maybe you could use VariScan. However, I don't know if it's the best way to do it for unphased data since you will have to produce 2 sequences for each individual, and therefore randomly assign each variant to one sequence.

ADD COMMENTlink written 3.1 years ago by jsgounot130

Thanks!

Do you know any literature doing the random assignment of variants if the data is unphased?

ADD REPLYlink written 3.1 years ago by RoseString0

Just an update. I found a study using your method. They call this process 'haploidize data'.

ADD REPLYlink written 3.1 years ago by RoseString0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 827 users visited in the last hour