High Sequence Duplication levels in FastQC
1
3
Entering edit mode
8.8 years ago
EVR ▴ 610

Hi,

I am new to NGS analysis. I have RNAseq samples generated from Illumina platform. When the samples loaded onto FastQC , I noticed that there is high percentage Sequence Duplication level (> 70%) for all samples but the per base sequence quality for all samples are really good. How should I approach with these data? Is this normal for Illumina samples? Any guidance would be highly appreciated.

Thanks in advance

RNA-Seq FastQC Sequence_Duplication • 11k views
ADD COMMENT
9
Entering edit mode
8.8 years ago
Sam ★ 4.7k

For RNA Seq, it is normal to see a high percentage of sequence duplication level. That is because the inherit nature of the RNA Sequencing is to count the depth of coverage of each gene. If you have a high coverage gene, it is likely you will encounter duplication.

As for exome sequencing, you would prefer a uniform coverage, therefore duplication might be a problem

Edit: Actually someone else has asked a similar question before High Duplication Rate of Mapped In RNA-seq

ADD COMMENT

Login before adding your answer.

Traffic: 2004 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6