Question: Get all completely sequenced genomes from one genus
gravatar for bird77
4 months ago by
bird7710 wrote:

Is there an automatic way to get the fasta sequences of all sequenced (preferably completely) genomes within a taxonomic group?

And how can I get the taxid for all of these organisms as well?

Thank you.

genome • 163 views
ADD COMMENTlink modified 4 months ago by tdmurphy90 • written 4 months ago by bird7710

For Ensembl there is no dedicated API way that I know of. If you are specifically interested in bacteria from Ensembl genomes here is a hackish script you can adapt.

ADD REPLYlink written 4 months ago by kloetzl960
gravatar for tdmurphy
4 months ago by
tdmurphy90 wrote:

This is easily accomplished from NCBI's Assembly resource: You can download FASTA, annotation, or other files using the big blue "Download Assemblies" button.

Note "complete genome" is a useful filter for bacteria, but there are only a handful of eukaryote assemblies that are sequenced to completion (mostly fungi). If you're interested in eukaryotes you may want to either focus on assemblies at the "chromosome" level (to exclude WGS assemblies that are just bags of scaffolds), or use the "exclude partial" filter to exclude the small number of assemblies that are focused on a subset of the genome (e.g. just one chromosome).

ADD COMMENTlink written 4 months ago by tdmurphy90
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 619 users visited in the last hour