Entering edit mode
7.1 years ago
matthew.wipperman ▴ 10
I am interested in writing a script to convert Integrated Microbial Genome (IMG) numbers to a different type of identifier, like an NCBI accession number (e.g., the
YP_ identifiers for proteins). I am unable to figure out a way to match these numbers to their corresponding genes/proteins. Any advice would be most welcome.
Have you had any success yet?
You could do this via BLAST if you have the sequences.
Could you explain how?
Download sequences with the relevant IMG identifiers. Blast each of these sequences against the NCBI database of choice (protein/nuccore/nr etc). Then you can match the sequences to the accessions etc via the actual sequence itself.