Question: GC content in bilsulfite converted library
gravatar for PoGibas
6.2 years ago by
PoGibas4.8k wrote:

I am confused about bisulfite converted library GC content.

Fastqc per base sequence content looks like this:

(%G has decreased and %A has increased, compared to reference genome).
1. Shouldn't %C have decreased instead of %G?

Bismark reports that >90% C's in CHG and CHH were methylated, however people from the wet lab say that in this organism only CpG methylation is possible.

2. Seeing such result (methylation in CHH and CHG) can we speculate that something went bad with bisulfite conversion?

3. Bismark/BS Seeker2 maps only those reads that have non-converted Cs (this is way we get high CH methylation percentage). What can be the reason that reads with converted Cs don't map?  

bisulfite • 2.6k views
ADD COMMENTlink modified 6.2 years ago • written 6.2 years ago by PoGibas4.8k

It looks as if reads have been (reverse) complemented.

ADD REPLYlink written 6.2 years ago by dariober11k

This is what we think too. If this is the data we got (Ion torrent) is it possible that something got messed up in base calling stage?

ADD REPLYlink written 6.2 years ago by PoGibas4.8k

I have no experience with Ion Torrent but I don't see why base calling should complemented. Are you sure this is a "standard" bisulfite library?

ADD REPLYlink written 6.2 years ago by dariober11k
gravatar for Devon Ryan
6.2 years ago by
Devon Ryan98k
Freiburg, Germany
Devon Ryan98k wrote:

That's really really strange. In all of my datasets the C percentage falls toward 0, causing T to jump to near 50%. Is this some sort of targeted BS-seq dataset? Did you run other samples at the same time and did they produce similar results?

In general, the C and T percentages should be pushed away from 25% by the bisulfite conversion and G and A percentages should still be around 25%. Not seeing that (and also seeing >90% CHH methylation when that's not expected) suggests pretty strongly to me that something either went very wrong during bisulfite conversion or the reads were treated in a very strange way prior to running fastQC. If you can confirm that no one monkeyed with the reads then I would suggest being very hesitant in trusting this dataset.

ADD COMMENTlink written 6.2 years ago by Devon Ryan98k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2293 users visited in the last hour