I have a list of 500 proteins with their sequences, gi number, organisms they belongs to, GenBank ids. I want to create a pie chart of the taxonomic phylum of these proteins. How I can I proceed?
So, far I thought to go for Id mapping using the GI/ GeneBank Ids to get the taxonomy Id's then probably I can use that taxonomy ids to get the list of phyla. But I have not seen any option in the Uniprot browser to do that ID making between gi's/GenBank ID to taxonomy ID but in NCBI taxonomy browser, there is an option to enter the list of organisms to get the taxonomy ids. But then how to get the phyla list? In the final output, I'm expecting something like below:
Phylum Counts Proteobactia 300 Acodonacteria 100 Cyanobacteria. 100 -------------------------- Total. 500