'Seqtk Trimfq' Truncates Sequence Names In Fastq Files
2
2
Entering edit mode
10.8 years ago
knhai45 ▴ 20

I am processing fastq files from illumina HiSeq1500. I used 'seqtk trimfq' to trim a few first last bases out, but found that the first line of each sequence in fastq files causes a truncation - the part shown in bold/italic get deleted. Does anybody here have a clue of how I can maintain this part even after running seqtk? I will really appreciate your input.

@HWI-1KL121:57:C0M56ACXX:8:1101:1148:2123 1:N:0:AGTCAA CCAGACTTGAGAAGTTCAGCATGGATGCCACAGATCCCTGGAGCTTTGCCCCGTTTCAACTGGNNNNNCGAAGTCCGTGTTTCCCCAAGCGAAGGTGGACC + @@@DDDDDFHBAFHEHAAHG4CBHEDFBFDH)?9CAFDH933?DH?D4?FHBFFGCH@DHIEG#####--(,;=@:;;==?CA:(59(9?@80559ABBCC

illumina fastq • 3.8k views
ADD COMMENT
2
Entering edit mode
10.8 years ago
Vince Buffalo ▴ 470

Hi knhai45,

I just made the correction Heng Li's awesome toolkit. Here's the pull request but in the meantime you could use my version.

ADD COMMENT
0
Entering edit mode

nice work, thanks for the improvements!

ADD REPLY
0
Entering edit mode
10.8 years ago
knhai45 ▴ 20

Hi Vince,

Thank you very much for your quick response and the fix. It works perfectly now.

Best wishes,

knhai45

ADD COMMENT
0
Entering edit mode

Hi knhai45. Please use the "Add a new anwser to the question!" only for real answers. Use the "Comment" button below an existing answer or comment to add your comment instead. This answer will be deleted.

ADD REPLY

Login before adding your answer.

Traffic: 3001 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6