Question: How do I blast against a specific portion of the nr database?
2
gravatar for phoenixmx
5.3 years ago by
phoenixmx20
United Kingdom
phoenixmx20 wrote:

I would like to blast my sequences against the swissprot database, using local blast. For the purpose of downstream applications, I would like to do this using the copy of the swissprot database from the NCBI's ftp site, not the version from swissprot's own website.

The NCBI version comes as tiny .msk and .pal files, accompanied with this message:

"Certain databases are subsets of a larger parental database. For those databases, alias and mask files, rather than actual databases, are provided. The mask file needs the parent database to function properly. The parent 

databases should be generated on the same day as the mask file. For example, to use swissprot pre-formatted database, swissprot.tar.gz, one will need to get the nr.tar.gz with the same date stamp."

I have both the nr database and swissprot files, with matching date stamps, downloaded and extracted. How do I now blast against the swissprot database? If I try to blast against swissprot as if it were a normal database, it tells me it cannot find 'swissprot.pin'. I have also gone through the help files from my blast executable (blastall 2.2.18) and see no parameters for mask or alias files.

blast uniprot nr swissprot ncbi • 2.1k views
ADD COMMENTlink modified 5.3 years ago by Carlos Borroto1.8k • written 5.3 years ago by phoenixmx20
1

cross posted: http://seqanswers.com/forums/showthread.php?t=43245

ADD REPLYlink written 5.3 years ago by Pierre Lindenbaum122k
0
gravatar for Carlos Borroto
5.3 years ago by
Carlos Borroto1.8k
Washington Metropolitan Area
Carlos Borroto1.8k wrote:

It sounds like you need a newer version of BLAST. I just confirmed I can use the .msk and .pal files for swissprot with 'Protein-Protein BLAST 2.2.28+'.

 

ADD COMMENTlink written 5.3 years ago by Carlos Borroto1.8k

That might indeed be the case, thank you. The blast installation I have access to on the server I'm using is rather old.

Would you mind please explaining exactly what the correct way to use those files would be? Do I just list set the -database parameter to swissprot, or set the database to nr and include extra parameters pointing to the mask or alis files? 

Not having done this before, I don't know what the correct approach is.

 

EDIT: I've got the most recent BLAST+ executable on the server now, and it works just fine if you set -db as swissprot and let it figure things out itself.

ADD REPLYlink modified 5.3 years ago • written 5.3 years ago by phoenixmx20
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1488 users visited in the last hour