Question: Merging BAM or VCF files, whats the difference?
gravatar for leonardo.rippel
8 weeks ago by
leonardo.rippel20 wrote:

Hello everybody.

I have two main set of file to analyze, the end goal is SNP discovery and haplotype for posterior imputation.

One dataset is haploid the other is diploid. I will investigate each separatelly

For each dataset I have shallow sequenced DNA 26 individual trees.

My question is; there is any practical implication on doing the Variant calling for each individual then merge into a big vcf rather then merge all alignemnts in one big .bam (RG for each individual) and then do the variant call.

As far as I can see the differences in the individual call will be relativized by mergevcf tool, no?

Thanks in advance.

ADD COMMENTlink written 8 weeks ago by leonardo.rippel20

In my experience, I've always kept single bam files and performed joint genotyping with GATK for this task! I think it's safer for latter purposes.

ADD REPLYlink written 8 weeks ago by brunobsouzaa400

So, obtaining a vcf for each sample and performing the joint genotyping with GATK. The thing is i'm not using GATK. I'm analysing the variants using DeepVariant. and I'm not sure if there is the same option. However I will keep in mind your solution.

ADD REPLYlink written 8 weeks ago by leonardo.rippel20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1040 users visited in the last hour