Question: CG content changed after trimming the adaptors
gravatar for kintany
6 days ago by
kintany10 wrote:

Hi all, I'm analyzing RNA-seq data, paired-end, coming from Nextera. I ran Fastqc to look at the quality and got the following report: here. So I was concerned with the amount of Nextera transposase left, so I ran trim_galore to trim it. But after trimming the report looks like this: here. The transposase is gone but now the GC content plot looks super weird. Any thoughts? Thank you.

fastqc rna-seq • 95 views
ADD COMMENTlink modified 6 days ago • written 6 days ago by kintany10

You may not have shared the files correctly since I can't see them in chrome.

Without seeing your data I am going to speculate that you may be encountering a well known observation of seeing a positional bias in first 10-15 bp of RNAseq datasets prepped with transposases based kits. In that case, you will find this blog post informative.

If this is not applicable then I suggest that you post screenshots of plots from FastQC using these directions: How to add images to a Biostars post

ADD REPLYlink modified 6 days ago • written 6 days ago by genomax62k

Thank you. I changed the format to pdf, you should the files now.

ADD REPLYlink written 6 days ago by kintany10

As expected that looked like a normal RNAseq dataset. Have you checked to see if you have rRNA contamination in your data? That may sometimes show up as the shouldered GC peak. As you did not have obvious adapter contamination in your data you could use the original data for alignment and let the aligner soft clip any parts that don't align.

ADD REPLYlink written 6 days ago by genomax62k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 634 users visited in the last hour