Get Taxonomy/Organism information from GenBank Ids
1
0
Entering edit mode
8.0 years ago

Hi BioStars,

Does anyone have any suggestions for fast ways to retrieve multiple genbank records using a list of genbank ids/accessions? I am interested in getting the Taxonomic and/or Organism fields for a large set of queries. They are blast hits for environmental RNA so could come from a variety of sources. I have a large number of runs (like 20 x 500 ids) so submitting them on the website is not really an option.

I tried the Entrez eSearch software but that is pretty dang slow to submit and download queries, plus I don't want to hammer the NCBI servers any more than I need to.

I have the space to download GenBank if anyone knows of a way to format the files into a form that is quickly queried.

Any help or tips would be appreciated.

genbank sequence blast • 3.6k views
ADD COMMENT
1
Entering edit mode
8.0 years ago
natasha.sernova ★ 4.0k

Actually this question has been already asked here,

Fetching Genbank Entries For List Of Accession Numbers.

with nice biopython scrips inside as answers.

There are more than one approach described there, read carefully.

ADD COMMENT
0
Entering edit mode

Thanks Natasha,

I had tried to find previous answers but I guess I missed this one.

ADD REPLY
0
Entering edit mode

Below there is it's url:

Fetching Genbank Entries For List Of Accession Numbers.

I've spoilt it a little bit.

*tps://www.biostars.org/p/66921/

Look at the post, it's helpful!

ADD REPLY

Login before adding your answer.

Traffic: 2660 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6