Hello,
I am new at processing Nanopore sequencing data and am having an issue:
I have binary fast5 files directly out of the Nanopore sequencer (R9 Flowcell) and I would like to use Dorado to perform basecalling as it seems to be the preferred tool. I have already used dorado to do simplex basecalling on pod5 files using argument "hac" for the model as it says on the nanoporetech/dorado GitHub page.
However, I can't seem to make it work on fast5 files even though the documentation says that it is supported for simplex basecalling (though less performant). This is what I type in my terminal:
$ dorado basecaller hac /directory/to/my/fast5/files --emit-fastq > output.fastq
I keep getting this error:
[error] Cannot automate model selection using fast5 files
I have tried using "fast" or "sup" instead of "hac" in case it would make a difference but to no avail.
Is there a specific model I should use or download ? Any other tools you could recommend for basecalling from fast5 files ? I know about guppy however I am unable to download it as it is an ONT tool.
Any help would be greatly appreciated.
Thanks,
Lele
Thank you for your reply. I have just tried this as well as using hac@v4.2.0 however I still get the same error... Guess I'll have to figure out another way.
Have you tried specifying exactly the model ? For instance
dna_r9.4.1_e8_hac@v3.3
for R9 flowcell.Just replaced hac by the full name of one of the downloaded model and it works. Silly mistake on my part as hac@latest still automatically chooses a model for you as it says in the model complex table.
Thanks again !