Question: VCFtools: --window-pi and --from-bp --to-bp produces different start and ending windows
0
gravatar for GabrielMontenegro
3.4 years ago by
United Kingdom
GabrielMontenegro560 wrote:

I am trying to estimate pi on a given region, therefore I used the --from-bp and --to-bp functions to specify it:

THE LOG FILE

Parameters as interpreted:
--vcf file
--chr 5
--to-bp 33976693
--out test
--window-pi 50000
--from-bp 33926693

After filtering, kept 103 out of 103 Individuals
Outputting Windowed Nucleotide Diversity Statistics...
After filtering, kept 1482 out of a possible 5033226 Sites
Run Time = 33.00 seconds

The region specified has exactly 50000 bp and therefore I use the --window-pi 50000 option to get the nucleotide diversity of that whole region, BUT

RESULT

CHROM BIN_START BIN_END N_VARIANTS PI
5 33900001 33950000 108 0.000323596
5 33950001 34000000 113 0.000431332

My region is within those two boundaries but this is not what I wanted... Also the N_VARIANTS doesn't add up to the 1482 kept variants that the log file is reporting.

snp genome vcf • 2.7k views
ADD COMMENTlink written 3.4 years ago by GabrielMontenegro560

Hello Sir, I am facing the same problem. I somewhat managed to get one whole region to come as one in Tajima's D calculation but I am unable to do so for the pi calculation.

Did you find a way to overcome this problem? I yes, can you please suggest me what to do?

Thanking you, Amit

ADD REPLYlink written 2.2 years ago by amitgourav.ghosh1260
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 687 users visited in the last hour