Hi there seniors,
I am currently new to command lines and I'm currently using this command line to download the genome of bacteria one by one, I wanted to ask if there is a faster way like since i have a list of GCF accession id to download the genome for analysis. I'm using this command:
./datasets download genome accession 'GCF_*' --include gff3,gbff,rna,cds,protein,genome,seq-report
Thanks in advance
Cheers,
Aldre
You can use
file_name_w_accession
should contain one accession per line. Use any additional options as needed.you should use the
wild cards
to download all the given above GCF files. Moverover, you can use the commands ofcurl
,wget
andslow5curl
to download the data from database.by means of that, how should I write my scripting?
say I have
btw sorry if I wasn't that clear with the replies, I just started learning command lines
Not in this situation, it is much easier to use the NCBI datasets tool.
hi there, thanks for the reply, is it possible to disclose an example of ./datasets script for that? thank you very much. i m currently trying various method as well...
here https://stackoverflow.com/a/1521498 is an example how to use read and for loop, you just need to put in your command.