Hola! I am encountering a strange problem. My fastqc graph is like this, with quality score increasing at the end, but we should observe is a decrease at the end.
The fastq files are generated using CASAVA-1.8.0 , so the format is supposed to be sanger encoded.
My previous graphs from different experiment they show a decrease in the end as opposed to this one.
Why I am observing this pattern (increase in quality scores at the end)?
Thanks for your comments.
What is your question?
Why I am observing this pattern (increase in quality scores at the end)?
Ever since we've switched to running our samples on a HiSeq machine[*], all of our phred distros exhibit this exact same pattern, and I'd have to say: judging by this quality distro plot alone, your data actually looks pretty great.
[*]I'm not sure if it was the switch to the HiSeq, or the upgraded software/chemistry -- maybe GAIIx runs look like this now, too ... I wouldn't know, though.
The modeling of error rates changed in recent versions of the Illumina software. See this question for more details: http://biostar.stackexchange.com/questions/12150/rna-seq-difference-in-read-quality-pattern-between-illumina-ga-and-hiseq-2000/12179#12179
@Steve @Brad Thanks for your comments, I think the graph is fine, its just the change in error model by Illumina