Why the number of reads in bam generated by GATK haplotype caller are more than the bam generated after GATK baserecalibrator
0
0
Entering edit mode
19 months ago
pragnapcu ▴ 10

I am wondering how the number of reads are more in haplotypecaller bam than in baserecalibrator bam though they dont appear to be duplicates?

BaseRecalibrator BAM GATK READ HaplotypeCaller • 1.2k views
ADD COMMENT
0
Entering edit mode

show us the data please.

ADD REPLY
0
Entering edit mode

enter image description here

As per explanation given here https://gatk.broadinstitute.org/hc/en-us/articles/360040096812-HaplotypeCaller#--bam-output , I noticed two categories of reads in the bam generated from GATK HaplotypeCaller. One set of reads start with HC and another set has original read name.

Can Someone help me in better understanding this scenario.

  1. There are some reads (upper segment; lower reads in pink) which do not have insertion or missense. Their readname is different. What exactly do the reads represent?
  2. Not all haplotype (HC) reads have the insertion. Does that mean it is heterozygous variant?
ADD REPLY

Login before adding your answer.

Traffic: 1901 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6