vcftools --TajimaD does not start at 0 and misses variants
1
0
Entering edit mode
3.1 years ago
grey ▴ 30

I have used vcftools --TajimaD to calculate Tajima's D statistics with other species easily in the past. However, I am running this now on a new vcf and getting strange results:

  1. the output file doesn't start at 0, but rather at 539700. There is nothing special about this number. The first variant in the VCF file is at position 36 and so forth. See screenshot below enter image description here enter image description here
  2. also, even for windows where it does make some calculations, it is missing variants that are clearly present in the VCF.

enter image description here

In case its helpful here is vcftools output enter image description here

Indexing the vcf (tabix) did not solve.

Anyone encountered this before?

vcftools tajimaD • 2.1k views
ADD COMMENT
1
Entering edit mode
3.1 years ago
grey ▴ 30

SOLVED:

in the vcf-merge command used to create the vcf file I added --ref-for-missing 0/0

Perhaps vcftools --TajimaD doesn't like having missing values (ie ./.)?

ADD COMMENT
0
Entering edit mode

Please accept your answer using the check mark on the left to mark the post as solved.

ADD REPLY
0
Entering edit mode

it gives error "you can not accept your own post"

ADD REPLY
0
Entering edit mode

That's odd. I think it could be a bug with the new site. I'll accept your answer for you and alert the admin to this bug.

ADD REPLY

Login before adding your answer.

Traffic: 1544 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6