Download Full Set of RefSeq Genomes for a Class
0
0
Entering edit mode
7.2 years ago
sbd10 • 0

Hello all.

For the past week or so I have been trying to figure out a way to download a full set of refseq genomes as FASTA files for enterobacterales, or gammaproteobacteria if enterobacterales isn't possible. I've been trying to figure out how to achieve this through E-Utilities with little success. My most recent attempt to gather a list of ftp urls was:

esearch -db assembly -query "Enterobacterales[organism] AND assembly_nuccore_refseq[filter]" |
esummary |
xtract -pattern DocumentSummary -element FtpPath

This returns an extremely large number of results which is not feasible to sort out, when I really only expected 120 hits based on: https://www.ncbi.nlm.nih.gov/genome/browse/reference/#

I'd prefer to remain in Unix and use E-Utilities if possible. Thanks for any advice/proposed modifications.

NCBI E-utilities • 1.4k views
ADD COMMENT

Login before adding your answer.

Traffic: 2483 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6