Extracting reads from fastq files with BBMAP?

0

Entering edit mode

3.0 years ago

simplitia ▴ 130

Hi I have a file with list of read ids I would like to extract from a pair-end read. This file only contains 1000 reads. When i use bbmap to accomplish this with the following command however, the resulting size of the fastq files are very similar to original and so are

/bbmap/filterbyname.sh in1=${fq1} in2=${fq2} out1=strip2_R1.fq.gz out2=strip2_R2.fq.gz names=names2.txt

This should only have 1000 reads in the new files but its in the millions!

Is there something I'm doing incorrectly? thanks!

bbmap fastq • 1.3k views

ADD COMMENT • link updated 3.0 years ago by GenoMax 141k • written 3.0 years ago by simplitia ▴ 130

0

Entering edit mode

What does names2.txt look like (show us output of head -5 names2.txt)? Remember to remove @ or > identifiers from read names in names2.txt. One read name per line in file.

ADD REPLY • link 3.0 years ago by GenoMax 141k

0

Entering edit mode

Hi, ok here is how the first few lines look like.

NAME1
K00153:706:H3VMJBBXY:4:1101:12601:1279
K00153:706:H3VMJBBXY:4:1101:25875:1279
K00153:706:H3VMJBBXY:4:1101:17563:1297
K00153:706:H3VMJBBXY:4:1101:18355:1297
K00153:706:H3VMJBBXY:4:1101:28178:1297
K00153:706:H3VMJBBXY:4:1101:8501:1314
K00153:706:H3VMJBBXY:4:1101:11180:1314
K00153:706:H3VMJBBXY:4:1101:22262:1314
K00153:706:H3VMJBBXY:4:1101:22648:1314

ADD REPLY • link 3.0 years ago by simplitia ▴ 130

0

Entering edit mode

You may need to add substring=t. These should work.

ADD REPLY • link 3.0 years ago by GenoMax 141k

Login before adding your answer.