Hello everyone,
I am looking to perform blastp with a few sequences on a few hundred genomes using the nr database.
My progress thus far involves:
- I know there is the NCBI website for blastp. However, I am not sure how I can make the input for many genomes. I know the taxid, gi and acc for my genomes but I would not rather type the taxid for each of my genomes into the input bar (for the nr database).
- I found out I can use the blast+ suite on my Ubuntu. Again, I am not sure how to use it. I found some tutorials to create my own databases, but since I have low computational power on my laptop I would rather like to send my files to the NCBI server. So how can I do properly?
Thanks a lot in advance!
You could write a script to send your queries to the NCBI server. BioPerl has Bio::Tools::Run::RemoteBlast for this or you can roll your own using the NCBI Blast API. Just be careful not to hammer their server or you may get banned from it.