Question

Read group addition order in RNA-seq short variant discovery (SNPs + Indels)

0

Entering edit mode

6 hours ago

iamsmor • 0

Hello, I’m following the GATK best practices for RNA-seq short variant discovery (SNPs + Indels) and wondering about the correct point to add Read Groups (RGs). Does the order (before/after MarkDuplicates or SplitNCigarReads) matter for RNA-seq variant calling with GATK (HaplotypeCaller)? Any official clarification or reference from the GATK team or papers? Pipeline: HISAT2, AddOrReplaceReadGroups, MarkDuplicates, SplitNCigarReads, BaseRecalibrator, HaplotypeCaller Thank you for any kind of help

rnaseq variantcalling addreadgroups gatk • 96 views

ADD COMMENT • link updated 4 hours ago by GenoMax 154k • written 6 hours ago by iamsmor • 0

0

Entering edit mode

Ideally you would do this at the beginning of the process as you align the data. Read groups would be tagging the samples with relevant info e.g. what lane they were in, if they ran on multiple flowcells etc (all of this may not be applicable in your case).

See Expected Input section: https://gatk.broadinstitute.org/hc/en-us/articles/360035535912-Data-pre-processing-for-variant-discovery

ADD REPLY • link 5 hours ago by GenoMax 154k

score 0 · Answer 1 · 2025-10-28

0

Entering edit mode

5 hours ago

iamsmor • 0

ı didnot do when ı align rnaseq datasets but I did it as soon as I realized I had to do it. ı am following https://gatk.broadinstitute.org/hc/en-us/articles/360035531192-RNAseq-short-variant-discovery-SNPs-Indels but here is not exact information.