How is vcf-merge handling different INFO for a single variant?

0

Entering edit mode

3.9 years ago

Cindy ▴ 40

I imputed several different datasets with the Michigan imputation server and downloaded the output dose.vcf.gz files. I then merged them together by chromosome. I have used both vcf-merge and bcftools merge. My question is, for the same variant, r-square imputation quality can be different between datasets; how is these merging commands dealing with that? I think this is important for downstream analysis. Thanks in advance!

RNA-Seq vcfools bcftools imputation • 1.3k views

ADD COMMENT • link 3.9 years ago by Cindy ▴ 40

0

Entering edit mode

Please read the bcftools manual:

-i, --info-rules -|TAG:METHOD[,…] Rules for merging INFO fields (scalars or vectors) or - to disable the default rules. METHOD is one of sum, avg, min, max, join. Default is DP:sum,DP4:sum if these fields exist in the input files. Fields with no specified rule will take the value from the first input file. The merged QUAL value is currently set to the maximum. This behaviour is not user controllable at the moment.

ADD REPLY • link 3.9 years ago by Ram 43k

0

Entering edit mode

Thank you! Exactly what I was looking for. I just want to confirm vcf-merge is following the same default, getting INFO from the first input file. I couldn't find it in the vcftools documentation.

ADD REPLY • link 3.9 years ago by Cindy ▴ 40

0

Entering edit mode

AFAIK vcftools is obsolete, there's no feature that bcftools or other tools such as vt/GATK SelectVariants don't cover.

ADD REPLY • link 3.9 years ago by Ram 43k

Login before adding your answer.