Can MarkDuplicates of Picard be used for RNA reads?
1
0
Entering edit mode
20 months ago
Nemo • 0

Hello,

I have bam files from RNA sequence data. I am following the pipeline of gatk in Variant calling in RNA sequences. In the second step, where the MarkDuplicates command of picard should be run, I am skeptical if this is only for DNA or RNA. As I read in the MarkDuplicates (Picard) there is this sentence:

This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA.

After reading this I am not sure, should I use it in RNA sequence pipeline?

variants MarkDuplicates rna picard • 671 views
ADD COMMENT
2
Entering edit mode
20 months ago
LChart 3.9k

Yes, you can use it in RNA-seq. The degree of usefulness will depend on the method of library preparation, however. In cases where fragmentation happens prior to amplification, MarkDuplicates can (and likely should) be used (or an equivalent positional deduplication program). In cases where fragmentation happens after amplification, then the same parent molecule can give rise to arbitrary sub-sequences -- in this case molecular identifiers (UMI) should be used for deduplication.

ADD COMMENT

Login before adding your answer.

Traffic: 2447 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6