I have many VCFs from different samples. If I merge these into a single VCF using vcftools (vcf-merge), the samples where a variant wasn't called are labeled as missing that variant. Instead, I want the VCF to show that the sample has the reference allele (safe to assume in my application).
Is there a way to call missing variants in a VCF as the reference allele? What tools can I use to do this?
EDIT:
The sequences were originally variant called using FreeBayes (through the LongRanger pipeline).
RE-EDIT:
Turns out I can simply use the --ref-for-missing flag in vcf-merge to achieve this. Problem solved.
RE-RE-EDIT:
Using --ref-for-missing flag in vcf-merge does of course not give the variants any annotation, like depth and genotype quality.
I think
vcf-mergewith the--ref-for-missingflag should solve the problem yourVcfNoCallToHomRefsolves. YourFixVcfMissingGenotypestool sounds really useful though.I didn't try your solution because annotating the depth only isn't enough for me. I accepted the answer anyway, as it solves the problem I stated.