I want to run nanopolish and I have some large fastq files. I want to subsample them but I also need the fast5 for the subsampled reads. What is the best method to do this considering the size distributions of the ONT reads?
did you manage to find the best way to downsample your dataset?
I didn’t end up doing it but check out this thread. There is a tool that is recommended:
Login before adding your answer.
Use of this site constitutes acceptance of our User Agreement and Privacy