I have a set of proteins from diverse Bacteria. I'm trying to analyze them using DAVID but it cannot recognize more than 80% of my GI numbers. Here you have one example:
gi|386865221|ref|YP_006278169.1| membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b]
So, the GI numbers is:
386865221
But DAVID (with identifier as PROTEIN_GI_ACCESSION) cannot recognize it. I also tried with NCBI to find the GI number but, there is nothing. Finally, I did a protein blast with the original protein sequence (see below) and I obtained a 100% match protein:
AFI69771.1 membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b]
Strikingly, I tried to make an ID conversion with biodbnet but, again, I didn't found anything.
My question is, is there any easy way to relate my GI and AFI identifiers?
gi|386865221|ref|YP_006278169.1| membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b] MNKIYKTIWCETTRSWVAVSEHANGKRGGATAAATTSARPIWTRLRGISLAALAAFGLGLFASPAAFAQS NSVMCANYNNGILPTYTGYGASPSLTSPCTTGIGSWAGGVTPGSTTNWIGLSADDTQIVLNGSTGNIYFR AGGTNGNTLTMSNVAGSGPTGGVLLSGVAAGAVTATSSQAINGSQLYSLSTSASTGIGSLSSSMSTFNSS ISSLSTGLSSTNSGLTSLSTSASTGLSSANSSIASLSSGLSSTNSSLTSLSTSASSGISTAQSGVNSLST GLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLSTGL ISTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTS ASSGISSANSSVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLS TGLSSTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSL STSTSSGISSANSSIASLSTSTSTGISSLSTGLSTTDSNLASLSTSTSTGLSSATSSIASLSTSTSTGIG SLSTGLSTTNSNLTSLSTSTSTGLSSANSSITSLSSGLSTTNSNVASLSSGLSSTNSSLTSLSTSTSSGI SSANSSVASLSTSTSTGIGSLSTGLSTTNSNLTSLSTSTSTGLSSANSSITSLSSGLSTTNSNVASLSTG LSSTNSSLTSLSTSTSSGISSANSSIASLSTSTSTGISSLSTGLSTTDSNLTSLSTSTSTGLSSANSSIT SLSSGLSTTNSNVASLSSGLSATNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLSTGLSTTNSNL TSLSTSTSTSLSSANSSITSLSSGLSTTNSNVASLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTS TSTGLSSANSSITSLSSGLSTTNSNVASLSSGLSTTNSSLTSLSTSASSGISSANSSIASLSTSTSTGIS SLSTGLSTTDSNLASLSTSTSTGLSSAASSIASLSTSTSTGISSLSTGLSTTDSNLTSLSTSTSTGLSSA NSSITSLSSGLSTTNSNVASLSSGLSTTNSSVASLSTGLSTTDSNLASLSTSTSTGLSSTTSSIASLSTS TSTSVDSLSTGLSTTNSSVASLSTGLSTTDSSLASLSTSTSTGLSSTTSSIASLSTSTSTSFSSALSSIG SLSTGLATTNSNLASLSTSTLTSVSSLSTGLSATNSSVASLSTSASTGLAATNSTVASLSTSTSTAVGSL STGLSTTNSNVASLSTSTSTAVGSLSTSLSTTNSNVASLSTSTSTSVNSLSTGLSTTNTSVASLSTSVTN LNTQLTSLSTTIVNSTNNVIRALPASTGIAADMSAPNAAAPSVTAGSNSVALGANSTDGGRSNVVSVGSA TQQRQITNVAAGTEGTDAVNVNQLNALSTSMSQSLAGQQGQINNLGSQLTQTQQALQQTDTMARQGIAAA TALTMLPQVEPGKTINVAVGVARFAGQSGMAFGASAHVTTNGILKLGIGVSGQNKTFGAGYGYSW
Try the suggestions at:
https://ncbiinsights.ncbi.nlm.nih.gov/2016/12/06/converting-gi-numbers-to-accession-version/
and
https://ncbiinsights.ncbi.nlm.nih.gov/2016/12/23/converting-lots-of-gi-numbers-to-accession-version/