I want to download all available protozoa genomes from the NCBI database. Using ncbi-datasets download is unfortunately not an option, as it doesn't recognize "protozoa" as a valid taxon.
However, I found that the genomes are shared via FTP at the following locations: GenBank - protozoa RefSeq - protozoa
I tried downloading the contents using wget, but it only retrieves the directory listings — the actual genome files inside the subfolders (e.g., .fna.gz, .gbff.gz, etc.) are not being downloaded recursively.
This is the command I used:
wget -r --continue --progress=bar:force:noscroll ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/protozoa/
How can I modify this or use another method to properly download all genome files, including those in the subdirectories?
Any help or suggestions would be greatly appreciated!
P.S I'm also interested if there is any way to download only microfungi genomes.