Help understanding this per base sequence content failure fastqc plot
1
0
Entering edit mode
9 weeks ago
curious ▴ 900

I have this fastqc result from short read WGS data:

enter image description here

I read this post , which suggests that the variation in the first few bases is a known artefact of library prep, so hopefully that bit does not produce concern for my analysis, but what is the variation that starts appearing around base 150 and beyond? Is that concerning? my reads are 150 bp PE. Thanks!

fastqc • 2.5k views
ADD COMMENT
2
Entering edit mode
9 weeks ago

that 151st base added is a base to ignore (it's a kind of random addition) ... if I recall correctly it's a consequence of the technology. There is a post about that as well but I can't immediately find it back.

UPDATE: here is some info in this thread Why does Illumina have the extra +1 cycle? (but that is not the one I was thinking of though)

UPDATE2: to really answer your question: no, not a concern at all. Just clip it off if it's there.

ADD COMMENT
0
Entering edit mode

Make this an answer rather than comment?

ADD REPLY

Login before adding your answer.

Traffic: 4491 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6