Hello,
I am new to using the datasets command line. I wish to download a subset of all Pseudomonas genomes from ncbi datasets. If I run the command:
datasets download genome taxon 286 --dehydrated --include genome,gff3 --filename Pseudomonas_Whole_Genera_NCBI/genomes.zip
This allows a successful download all of the Pseudomonas genomes.
However, this is long and takes up too much storage on my computer. Is there anyway I can specify only to download a random sample of 200 of the total genomes in Pseudomonas?
I could manually label all accession IDs but is there quick/manageable way to get these? Are there other options? Any help would be appreciated.
Best,
B
Maybe make this an answer rather than a comment?
Wanted to make it a complete answer with an additional command. Moved now.
Thanks! Works great.