Question: What To Do With An Error In The Fastq Illumina Quality Scores
5
gravatar for Allpowerde
9.5 years ago by
Allpowerde1.2k
Allpowerde1.2k wrote:

I have a FASTQ* file with reads from an Illumina machine and try to do the quality control filtering with the FASTX-Toolkit but get problems with the quality scores (see this post for a nice discussion about the scores)

While fastx_quality_stats and fastx_trimmer run without complaining, 'fastq_quality_filter' is suddenly not happy with the files

fastq_quality_filter: Error: invalid quality score data on line 148 (quality_tok = "Z]aaaaa]O]aabaaaaa]" Petra*read2.fasta

The particular read looks like this :

@Petra_4_1_1_10_1327/1
AGTATTTTTGAATCTCATCATCGTCACTTCACTAAG
+Petra_4_1_1_10_1327/1
`Z]aaaaa]O]aabaaaaa`][FW`__a`\FW_X[M

Does anyone have a suggestion (other than deleting this read) or some experience?

*well it is labeled .FASTA but looks like a FASTQ file

ADD COMMENTlink modified 13 months ago by RamRS24k • written 9.5 years ago by Allpowerde1.2k
1

This may be a stupid question, but are you sure about which encoding is being used for quality scores? I'd assume the latest Illumina encoding, but can you be sure?

ADD REPLYlink written 9.5 years ago by Phis1.0k
4
gravatar for Brad Chapman
9.5 years ago by
Brad Chapman9.4k
Boston, MA
Brad Chapman9.4k wrote:

Which version of the fastx toolkit do you have installed? This seems to be working with the latest version (0.0.13):

% cat test.fastq
@Petra_4_1_1_10_1327/1
AGTATTTTTGAATCTCATCATCGTCACTTCACTAAG
+Petra_4_1_1_10_1327/1
`Z]aaaaa]O]aabaaaaa`][FW`__a`\FW_X[M
mothra:fastq % fastq_quality_filter -q 20 -p 50 -i test.fastq
@Petra_4_1_1_10_1327/1
AGTATTTTTGAATCTCATCATCGTCACTTCACTAAG
+Petra_4_1_1_10_1327/1
`Z]aaaaa]O]aabaaaaa`][FW`__a`\FW_X[M

You might have a version before support for the latest 1.3+ Illumina pipeline scores were introduced. An alternative to upgrading the fastx toolkit is to use Galaxy to convert the scores into Solexa format.

ADD COMMENTlink modified 13 months ago by RamRS24k • written 9.5 years ago by Brad Chapman9.4k
3
gravatar for Zhilong Jia
7.7 years ago by
Zhilong Jia1.5k
London
Zhilong Jia1.5k wrote:

add option '-Q 33' : ref http://seqanswers.com/forums/showthread.php?t=7399

ADD COMMENTlink written 7.7 years ago by Zhilong Jia1.5k
0
gravatar for Allpowerde
9.5 years ago by
Allpowerde1.2k
Allpowerde1.2k wrote:

It must have something to do with the wrong file endings. Since Brad demonstrated that fastx toolkit does not really have a problem with the particular read, I changed the file ending from the wrong FASTA into FASTQ and now it works like a charm.

So the above error message should have been: fastqqualityfilter: Error: invalid input for the specified file type X.fasta

Thanks Brad and PhiS for your comments !!

ADD COMMENTlink written 9.5 years ago by Allpowerde1.2k

Glad you got this working. I can't replicate your error by copying the filename to test.fasta in my example above; it still works fine. Perhaps this is an issue that recent versions take care of; it would be useful to leave a comment with the version you were having problems with for future folks who find this thread.

ADD REPLYlink written 9.5 years ago by Brad Chapman9.4k

Hmm that is interesting. I'm using the latest version and I know that it is not an issue of my OS because another machine (not sure what version it uses) returned the exact same error when presented with this file. Could it be some line ending issue that was resolved when I copied from .fasta to .fastq ?

ADD REPLYlink written 9.5 years ago by Allpowerde1.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2027 users visited in the last hour