How to randomly subset a specific number of sequences from a large fastq file?
1
0
Entering edit mode
4.9 years ago
j.thomas93 • 0

Hi,

So I am doing some RNA seq and I have a number of large library preps for the different conditions and replicates. In some samples there a huge differences in library size and I was hoping to just do a preliminary analysis where I randomly subset x amount of reads from the parent fastq, but I want to do it multiple times from the sample fastq to generate a more representative sample. I need the sampling to therefor be random and different for each subset

Is this possible?

Thanks in advance for any help

RNA-Seq • 2.6k views
ADD COMMENT
0
Entering edit mode

Also seqkit sample by specifying --number or --proportion.

ADD REPLY

Login before adding your answer.

Traffic: 2683 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6