How is vcf-merge handling different INFO for a single variant?
0
0
Entering edit mode
10 months ago
Cindy ▴ 10

I imputed several different datasets with the Michigan imputation server and downloaded the output dose.vcf.gz files. I then merged them together by chromosome. I have used both vcf-merge and bcftools merge. My question is, for the same variant, r-square imputation quality can be different between datasets; how is these merging commands dealing with that? I think this is important for downstream analysis. Thanks in advance!

RNA-Seq vcfools bcftools imputation • 347 views
ADD COMMENT
0
Entering edit mode

Please read the bcftools manual:

-i, --info-rules -|TAG:METHOD[,…] Rules for merging INFO fields (scalars or vectors) or - to disable the default rules. METHOD is one of sum, avg, min, max, join. Default is DP:sum,DP4:sum if these fields exist in the input files. Fields with no specified rule will take the value from the first input file. The merged QUAL value is currently set to the maximum. This behaviour is not user controllable at the moment.

ADD REPLY
0
Entering edit mode

Thank you! Exactly what I was looking for. I just want to confirm vcf-merge is following the same default, getting INFO from the first input file. I couldn't find it in the vcftools documentation.

ADD REPLY
0
Entering edit mode

AFAIK vcftools is obsolete, there's no feature that bcftools or other tools such as vt/GATK SelectVariants don't cover.

ADD REPLY

Login before adding your answer.

Traffic: 2616 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6