Peaks in GC% histogram
1
0
Entering edit mode
7.1 years ago
wengkhong ▴ 10

Hi folks,

I am doing QC on a batch of ribo-depleted RNA-seq samples (human) and am seeing something strange in the GC content histogram. There appears to be spikes interspersed in the GC% histogram. I thought this could be due to un-depleted rRNA sequences, but I have tried to remove reads mapping to human rRNA sequences, and this artefact remains. Has anyone else seen something like this?enter image description here

RNA-Seq • 2.1k views
ADD COMMENT
0
Entering edit mode

It's clear from the histogram that a duplication was occurred, but the significance of this observation is correlate with data.Can you add details, length of sequence an the method used: as a stream or with fixed windows.

ADD REPLY
2
Entering edit mode
7.1 years ago

Given the regular spacing of these spikes, it looks to me like it is an algorithmic artifact caused by the read length and number of bins. Try changing the number of bins to exactly the read length.

ADD COMMENT
1
Entering edit mode

You're right. It was some issue with the binning. I plotted the underlying data, which was binned at exactly the read length and it looked perfectly normal. Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2452 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6