Question: Get all completely sequenced genomes from one genus
gravatar for bird77
29 days ago by
bird770 wrote:

Is there an automatic way to get the fasta sequences of all sequenced (preferably completely) genomes within a taxonomic group?

And how can I get the taxid for all of these organisms as well?

Thank you.

genome • 108 views
ADD COMMENTlink modified 28 days ago by tdmurphy80 • written 29 days ago by bird770

For Ensembl there is no dedicated API way that I know of. If you are specifically interested in bacteria from Ensembl genomes here is a hackish script you can adapt.

ADD REPLYlink written 29 days ago by kloetzl860
gravatar for tdmurphy
28 days ago by
tdmurphy80 wrote:

This is easily accomplished from NCBI's Assembly resource: You can download FASTA, annotation, or other files using the big blue "Download Assemblies" button.

Note "complete genome" is a useful filter for bacteria, but there are only a handful of eukaryote assemblies that are sequenced to completion (mostly fungi). If you're interested in eukaryotes you may want to either focus on assemblies at the "chromosome" level (to exclude WGS assemblies that are just bags of scaffolds), or use the "exclude partial" filter to exclude the small number of assemblies that are focused on a subset of the genome (e.g. just one chromosome).

ADD COMMENTlink written 28 days ago by tdmurphy80
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 799 users visited in the last hour