Question: How to get a strain name if I know only an assembly ID?
gravatar for little_more
9 months ago by
little_more10 wrote:

Suppose I have a tree with assembly IDs of different E.coli strains in it (all from NCBI). Is there any common way to replace each assembly ID with the corresponding strain name? I tried using BioPython (efetch) but it raises an error.

assembly • 308 views
ADD COMMENTlink modified 9 months ago by vkkodali1.1k • written 9 months ago by little_more10

Always post sample ID's when asking a question like this. Posting your code/actual error is also beneficial if you don't want an alternate answer.

ADD REPLYlink modified 9 months ago • written 9 months ago by genomax70k
gravatar for vkkodali
9 months ago by
United States
vkkodali1.1k wrote:

You can try something like this using Entrez Direct (

esearch -db assembly -q '854998' | esummary | xtract -pattern DocumentSummary -def "NA" -element Taxid Genbank RefSeq Organism

You can replace the id 854998 (which is a RefSeq assembly ID) with either assembly UID, Genbank assembly ID, or an NCBI assembly accession.

ADD COMMENTlink written 9 months ago by vkkodali1.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1529 users visited in the last hour