Question

Read group addition order in RNA-seq short variant discovery (SNPs + Indels)

0

Entering edit mode

8 days ago

iamsmor • 0

Hello, I’m following the GATK best practices for RNA-seq short variant discovery (SNPs + Indels) and wondering about the correct point to add Read Groups (RGs). Does the order (before/after MarkDuplicates or SplitNCigarReads) matter for RNA-seq variant calling with GATK (HaplotypeCaller)? Any official clarification or reference from the GATK team or papers? Pipeline: HISAT2, AddOrReplaceReadGroups, MarkDuplicates, SplitNCigarReads, BaseRecalibrator, HaplotypeCaller Thank you for any kind of help

rnaseq variantcalling addreadgroups gatk • 645 views

ADD COMMENT • link updated 8 days ago by GenoMax 154k • written 8 days ago by iamsmor • 0

score 1 · Answer 1 · 2025-10-28

1

Entering edit mode

8 days ago

GenoMax 154k

Ideally you would do this at the beginning of the process as you align the data. Read groups would be tagging the samples with relevant info e.g. what lane they were in, if they ran on multiple flowcells etc (all of this may not be applicable in your case).

See Expected Input section: https://gatk.broadinstitute.org/hc/en-us/articles/360035535912-Data-pre-processing-for-variant-discovery

ADD COMMENT • link 8 days ago by GenoMax 154k

0

Entering edit mode

ı didnot do when ı align rnaseq datasets but I did it as soon as I realized I had to do it. ı am following https://gatk.broadinstitute.org/hc/en-us/articles/360035531192-RNAseq-short-variant-discovery-SNPs-Indels but here is not exact information.

ADD REPLY • link 8 days ago by iamsmor • 0

1

Entering edit mode

Do it before you start doing any manipulations after alignments.

ADD REPLY • link 7 days ago by GenoMax 154k

0

Entering edit mode

yes this is exactly what ı have done but anway ı couldnot find any clear reference for exact pipeline .so actually ı write here maybe someone can help for reference , because nothing clear when ı search on google. thank you for help, ı also feel ı am right but ı amnot sure anyway without papers.

ADD REPLY • link 7 days ago by iamsmor • 0

2

Entering edit mode

https://gatk.broadinstitute.org/hc/en-us/articles/360035531192-RNAseq-short-variant-discovery-SNPs-Indels references the https://gatk.broadinstitute.org/hc/en-us/articles/360035535912-Data-pre-processing-for-variant-discovery pipeline linked above. In RNAseq pipeline STAR is replacing default aligner (to account for splicing) so the rest of the steps from there on should be as per the input requirements for DNA pipeline.

We use MergeBamAlignment and MarkDuplicates (similarly to our DNA pre-processing best practices pipeline)

ADD REPLY • link 7 days ago by GenoMax 154k

0

Entering edit mode

thank you very much. actually ı follow this link's pipeline also but should read more carefully for meaning . thanks alot Genomax .

ADD REPLY • link 6 days ago by iamsmor • 0