I am looking for a programmatic way to download complete genomes from RefSeq for a specific taxonomy identifier, in my case I am interested in the Lactobacillales order.
- From the Genome Download FTP it is not possible to filter by the order, only genus and species.
- I tried to follow the steps described in this post. First two steps (esearch and elink) are okay, but then I do not know how to select my genomes of interest according to my criterion (complete and from RefSeq).
- I also tried the Ebot pipeline genertaor, but then again I am not sure about the query qualifiers to apply.
- My best approach so far is to use the Genome browser. From there, I can apply my filters and then download the selected records as a .csv or .txt. How can I get the same output file without using the web interface?
Thanks in advance for any hints and help!