Entering edit mode
13 months ago
Jack
•
0
Hi, I am a MSc student struggling with what statistics I can do on my allele counts array generated by simulation with each set having a population from selection (soft sweep) and another from neutral selection. I wanted to do Tajima's D with scikit allele but this requires the base positions at the same location in the genome between both but my files have very little overlap in sites. Should I calculate the statistic between sites within the same window region of 100 Mb even if they don't line up or is there something else I can do? Thanks in advance, this means a lot :)