Is it possible to pull Genbank ID from RefSeq?
1
0
Entering edit mode
4.2 years ago
anabaena ▴ 10

Hey everyone, I have a list of RefSeq protein ID's and need to pull the GenkBank/INSDC data. When I pull the attached data to the RefSeq WP_XXXXX either through esummary, or elink I get no connection to anything else not even the WGS data, so I was wondering if it was even possible to do so in Biopython Entrez?

Entrez Biopython • 843 views
ADD COMMENT
2
Entering edit mode
4.2 years ago
GenoMax 141k

With Entrezdirect you could do something like this:

$ esearch -db protein -query "WP_060189833.1" | elink -target nuccore | efetch -format acc
NZ_LPHR01000079.1
NZ_LPKX01000035.1
NZ_LPBZ01000030.1
NZ_MEBG01000045.1

These are GenBank ID's but they refer to genome sequences.

ADD COMMENT
0
Entering edit mode

Awesome thanks! I am assuming these commands can be translated into Biopython? as I need to iterate over a large list?

ADD REPLY
1
Entering edit mode

Yes. Should be possible.

ADD REPLY

Login before adding your answer.

Traffic: 2975 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6