Dear All,
How can I search for a read by part of its name in a big fastq.gz file (size around 13GB)? For example, I would like to search for a read name containing the VH01677:31:AACCMFHHV:1:1101:6586:25290
string in a fastq.gz file. For a small-size fastq.gz file, I just use the gunzip command in the Mac OS terminal to unzip the file, open it with the text editor, and use the "Ctrl + F" key to search the read. But for a big-size fastq.gz file, I do not want to do it this way as it is very inefficient.
Thanks,
Xiao
Hi, rpolicastro,
Thank you very much! I tried seqkit grep and it works! It seems to me you changed the arguments from
-Irp
to just-rp
.Xiao
I'm glad it worked! Whether or not you include
-I
as an argument you'll get the same results, so I decided to edit my post and remove it just to simplify the answer.I want to print the matched read on the screen, so I use the
-Irp
argument without the-o
argument, and it works. Thanks again!Xiao