How to blastp
1
0
Entering edit mode
8.8 years ago

Hello everyone,

I am looking to perform blastp with a few sequences on a few hundred genomes using the nr database.

My progress thus far involves:

  1. I know there is the NCBI website for blastp. However, I am not sure how I can make the input for many genomes. I know the taxid, gi and acc for my genomes but I would not rather type the taxid for each of my genomes into the input bar (for the nr database).
  2. I found out I can use the blast+ suite on my Ubuntu. Again, I am not sure how to use it. I found some tutorials to create my own databases, but since I have low computational power on my laptop I would rather like to send my files to the NCBI server. So how can I do properly?

Thanks a lot in advance!

genome blast • 3.1k views
ADD COMMENT
0
Entering edit mode

You could write a script to send your queries to the NCBI server. BioPerl has Bio::Tools::Run::RemoteBlast for this or you can roll your own using the NCBI Blast API. Just be careful not to hammer their server or you may get banned from it.

ADD REPLY
1
Entering edit mode
8.8 years ago
dago ★ 2.8k

If you want to perform a blastp search for the protein encoded in you genome, you can use the .faa of your genomes as -query in blasp and select nr as -db. To perform search on the NCBI server you can select the flag -remote.

See blast option here.

ADD COMMENT
0
Entering edit mode

What is the difference between the genomes in my .faa and the nr database? Shouldnt first download the genome from the nr database? And my sequence is from a separate genome and has nothing to do with the ones I am trying to blast.

ADD REPLY

Login before adding your answer.

Traffic: 1697 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6