Error encountered using bowtie2 on Illumina fastq file
1
0
Entering edit mode
7.0 years ago
ggman ▴ 80

Dear all,

I am trying to run bowtie 2 on my Illumina file and I receiving the following error:

Error: Encountered one or more spaces while parsing the quality string for read NB50
1124:10:H5LKGAFXX:3:21612:11073:20398 1:N:0:CTTGTN.  If this is a FASTQ file with in
teger (non-ASCII-encoded) qualities, try re-running with the --integer-quals option.
Error: Encountered exception: 'Unidentified exception'

this is a sample of my data

@NB501124:10:H5LKGAFXX:3:11401:20911:1028 1:N:0:
CTTGTANAAATCTCACATTCATAGGGCAGCTCTCCTGAATGTATCAATTCAAGACTTTTGAGATTATCAGAACGTAAGAGCAAACGTATTAAAAACACTCACCTTATTATCTATTATCCAGTATGAATATTTCATAAAAATAAGAGATTGTGGAGGG
+

Thanks for the help,

Cheers

bowtie2 Illumina fastq file • 3.1k views
ADD COMMENT
0
Entering edit mode

And how does the quality string of read 1124:10:H5LKGAFXX:3:21612:11073:20398 1:N:0:CTTGTN look like?

ADD REPLY
0
Entering edit mode

I checked the quality using Fastqc and it was good. (if this is what you are asking?)

ADD REPLY
0
Entering edit mode

I think I found part of the issue with you asked @genomax2, my output looks like this

    @HD VN:1.0  SO:unsorted
@SQ     SN:Scaffold1    LN:540667
@SQ     SN:Scaffold2    LN:419338
ADD REPLY
0
Entering edit mode

Do the (z) grep on the fastq file. Not the bam file.

ADD REPLY
0
Entering edit mode
@NB501124:10:H5LKGAFXX:3:21612:11073:20398 1:N:0:
CTTGTNCCTGAGAATGNTGAGCATAACAAGTTTTTTTAACTTGTGTTGTCCCCCATCTCTTTGTGAANCCAANGCAAANCATCCTCAAATAGTAGCCATCGGTAGTCTTTACATC
+
AAAAAEEEEE#EEEEEE

that doesn't look right...

ADD REPLY
2
Entering edit mode

As you can see it is missing Q values for a number of bases in that read. If that is the only issue then you could delete that read (and its mate) from R1/R2 files.

ADD REPLY
0
Entering edit mode

thank you for the help geno

could you post it as an answer so I could close this topic?

ADD REPLY
0
Entering edit mode

Done. Recopy your result from post above below the answer. Then I can clean this thread up.

ADD REPLY
1
Entering edit mode
7.0 years ago
GenoMax 141k

We would like to see the output of zgrep -A 3 NB501124:10:H5LKGAFXX:3:21612:11073:20398 if your data file is compressed or just grep if it is not.

ADD COMMENT

Login before adding your answer.

Traffic: 2555 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6