Hi, I was doing multiqc on raw rnaseq datasets, it is showing higher level of read duplication. Do I need to take any step for this datasets before processing for read quantification. I am using trimmommatic to remove low quality read and adapter sequences but not sure if I need to take any other steps. Multiqc file attached. Many thanks.
Thanks for your response. This is total RNAseq data.
Just try aligning the reads and see what happens — and you can check which transcripts are most abundant. You might also consider aligning to rRNA as well, then tossing out those reads, and running QC again.