I have two paired-end fastq compressed files coming from HiSeq RNA-SEq experiment, ie., pair.1.fastq.gz and pair.2.fastq.gz.
The files are very large so I wanted to just take a few million/thousand reads from each of them (by their respective pairs) and use that file for trying/debuuging purposes.
The results should be two paired-end files, i.e., pair.test.1.fastq.gz and pair.test.2.fastq.gz.
I'd be happy to hear some suggestions on how to do this or hear about tools available, thanks!
Selecting random pairs from fastq?
thanks Pierre, I didn't realize someone else asked about it!