I have cut the 3`adapters from my RNA-Seq sequences with cutadapt 2.6 and when loading the trimmed sequences for alignment in Rsubread, program is aborted after several lines with the "ERROR: a format issue @ is found on the 393884-th line in input file".
I did not find anything about this issue in any of the manuals for either cutadapt or Rsubread. I am also quiet new to this, so don´t know if I did not just overlooked something obvious...
Would anyone know how to proceed further, please?
Output of
head -n 393884 your.fastq | tail
?Did not work for some reason, so I tried in R:
Read 1 item
[1] "@7001425F:195:CDYHMANXX:3:1102:19436:35854 1:N:0:CACTCA"
Can you show a few lines before and after than. The issue is probably somewhere there.
It seems there is missing line, for some reason:
Read 18 items
Yes, the file is corrupted as for
"@7001425F:195:CDYHMANXX:3:1102:19414:35790 1:N:0:CACTCA"
two lines are missing. Consider to use e.g.repair.sh
from BBmap to try and fix the file, discarding the corrupted part.Thank you for your help. But as this is becoming far too "black box" for me and as there is only small fraction of sequences with adapters, I´ll probably try to go for alignment without trimming. Or try different tool to cut the adapter. Is it legitimate?
Filtering all the reads (minimal length was set to 10) solved the issue!
Thank you all for your help!