Question: FastQC low %A in per base sequence content
0
gravatar for Duff
22 months ago by
Duff660
United Kingdom
Duff660 wrote:

Hi

I have some Illumina RNA seq data, 75bp paired end reads. I've run FastQC & trimmomatic and then re-run FastQC. Both before and after trimmomatic quite a few of my sequences fail the per base sequence content with the %A content running low in read 1 (e.g. http://i.imgur.com/Cv4HXJq.png) and the %T running low in read 2 (e.g. http://i.imgur.com/ul7v55H.png). Read 2 tends to be a warning rather than a fail.

Per base quality is good across the board, per sequence GC content looks fine and the only other fails I get are duplicated sequences and kmer content (both of which I gather are common fails in mRNA seq data).

I've little experience with RNA seq QC - is this something to be concerned about?

Thanks for any advice.

Best,

Iain

fastqc rna-seq • 1.1k views
ADD COMMENTlink modified 22 months ago by harold.smith.tarheel4.3k • written 22 months ago by Duff660
2
gravatar for harold.smith.tarheel
22 months ago by
United States
harold.smith.tarheel4.3k wrote:

This is the expected result if 1) your libraries are strand-specific and 2) they contain some poly(A)+ contamination. You should also observe oligo-T among the over-represented k-mer content.

ADD COMMENTlink written 22 months ago by harold.smith.tarheel4.3k

Ah - strand specific. So obvious now you've pointed that out. Thanks. Can't see any poly A or oligo T in the kmers.

ADD REPLYlink written 22 months ago by Duff660

If this answer was helpful you should upvote it, if the answer resolved your question you should mark it as accepted. This helps future readers of your question to evaluate the response.

ADD REPLYlink written 22 months ago by harold.smith.tarheel4.3k

I was looking at a fungal genome once that seemed to consist of only 3 letters on each strand. IIRC, one strand was only A, C, and G and the other was only C, G, and T (at least, in the areas I looked at). So if as Harold suggests your data is strand-specific, that particular organism might show this kind of pattern, if one of the two strands was generally coding and the other wasn't.

ADD REPLYlink written 22 months ago by Brian Bushnell16k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 781 users visited in the last hour