Summarizing list of Genbank Assembly Accessions
1
0
Entering edit mode
3.1 years ago
fheller • 0

I have a list of genbank accession ID numbers resulting from an experiment. I'm interested in summarizing the strains to which they belong, eg by grouping them at a genus or species level. Does anyone know of a tool me to do this? Initially I thought I could use something like phyloT (https://phylot.biobyte.de), but my IDs are in the wrong format. Any suggestions as to tools I might look into or search terms I might use to explore further myself would be greatly appreciated.

Examples of the sort of Genbank Assembly Accessions I have: GCA_013300845.1 GCA_014847545.1 GCA_004167785.1

assembly gene genome alignment • 492 views
ADD COMMENT
0
Entering edit mode
3.1 years ago
GenoMax 141k

Using EntrezDirect:

$ more id
GCA_013300845.1
GCA_014847545.1
GCA_004167785.1

$ epost -db assembly -format acc -input id | esummary | xtract -pattern DocumentSummary -element Genbank,Organism,Sub_type,Sub_value
GCA_014847545.1 Ligilactobacillus ruminis (firmicutes)
GCA_013300845.1 Blautia glucerasea (firmicutes) strain  MSK.20.42
GCA_004167785.1 Bacteroides ovatus (CFB group bacteria) strain  am_0171
ADD COMMENT

Login before adding your answer.

Traffic: 2154 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6