How to get reference genome sequence given species name
2
0
Entering edit mode
4.1 years ago
dp ▴ 50

Hi I have a list of bacterial species and would like to programmatically download all of their reference genomes. The list includes only the names of the species. eg:

Actinomyces_viscosus
Corynebacterium_matruchotii
 :
 :
etc etc

Is there an easy way to do this?

Thanks!

genome reference sequences • 1.1k views
ADD COMMENT
1
Entering edit mode
4.1 years ago
svp ▴ 680

If you are using conda

conda install -c bioconda ncbi-genome-download

or simply python

pip install ncbi-genome-download

You can use following command in a bash script to download the genome

list=["Actinomyces_viscosus", "Corynebacterium_matruchotii"]
for i in list; do
ncbi-genome-download --refseq-category  --genus $i bacteria;
done
ADD COMMENT
0
Entering edit mode
4.1 years ago
GenoMax 141k

Use ncbi-genome-download tool from Kai Blin.

ncbi-genome-download --refseq-category  --genus "Actinomyces viscosus" bacteria

Replace genus names from your list in a loop.

ADD COMMENT

Login before adding your answer.

Traffic: 2863 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6