convert GCF code to outer leaf Organism name
1
0
Entering edit mode
4.5 years ago

Hi all!

I have a phylogenetic tree with GCF numbers as outer leaves. I would like to convert these numbers into species names. I already tried to solve this with ete3 within Python or Archaeopteryx to download this information from NCBI. This did not work out. Would be great if someone knows the solution.

Tree-format is newick.

phylogeny archaeopteryx figtree ete3 • 903 views
ADD COMMENT
1
Entering edit mode
4.5 years ago
GenoMax 141k

Are these GCF NCBI assembly accession numbers? If so, using EntrezDirect:

$ esearch -db assembly -query "GCF_000008865" | efetch -format docsum | xtract -pattern DocumentSummary -element AssemblyAccession,Organism
GCF_000008865.2 Escherichia coli O157:H7 str. Sakai (E. coli)
GCF_000008865.1 Escherichia coli O157:H7 str. Sakai (E. coli)
ADD COMMENT
0
Entering edit mode

I'm sorry! GCF_001598035

ADD REPLY
1
Entering edit mode

Here you go:

$ esearch -db assembly -query "GCF_001598035" | efetch -format docsum | xtract -pattern DocumentSummary -element AssemblyAccession,Organism
GCF_001598035.1 Streptococcus halotolerans (firmicutes)

Remove AssemblyAccession from command above if you just want the name.

ADD REPLY

Login before adding your answer.

Traffic: 2652 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6