Question: What'S The Easiest Way To Blast 5000 Sequences Against An Exon Fragment?
gravatar for John
8.4 years ago by
John770 wrote:

I have 5000 EST sequences that I downloaded from Genbank. I have a 500 bp exon fragment that I'd like to BLAST against this lot of sequences to see if my exon is present or not. What's the easiest way to do this?

I thought that there might be a way to do this through the NCBI directly without even having to download the sequences (e.g. an option to 'BLAST these results') but I can't find one. My next option was to download BLAST, setup a local database, import the 5000 sequences and then BLAST my exon against that.

Is there an easier option?

Thanks, John

est blast • 2.4k views
ADD COMMENTlink written 8.4 years ago by John770

yes, they are the same species.

ADD REPLYlink written 8.4 years ago by John770

how did you selected these 5000 ESTs in the first place? Same/close species as your exon fragment?

ADD REPLYlink written 8.4 years ago by Darked894.2k

You can use to make it easier to set up the custom blast database and run searches...

ADD REPLYlink written 7.7 years ago by Yannick Wurm2.3k
gravatar for Jarretinha
8.4 years ago by
São Paulo, Brazil
Jarretinha3.3k wrote:

In this case, I wouldn't use BLAST. Exonerate can be much superior in this kind of task and don't need a database setup. As suggested by Pierre, it can do exhaustive search efficiently and have a lot of models used during alignment (e. g. protein to DNA, EST to genome). It's fast, flexible with a plethora of options. The only incovenient is its output. If you want to try it, learn how to use the --ryo option.

ADD COMMENTlink written 8.4 years ago by Jarretinha3.3k

+1, I once used exonerate for a similar problem (mapping ESTs to mRNAs).

ADD REPLYlink written 8.4 years ago by Giovanni M Dall'Olio26k
gravatar for Pierre Lindenbaum
8.4 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum122k wrote:

(I would download the executable version of blast). But are you sure you want to use Blast ? 5000 is not so big, why not running a smith-waterman alignment ?

ADD COMMENTlink written 8.4 years ago by Pierre Lindenbaum122k
gravatar for Larry_Parnell
8.4 years ago by
Boston, MA USA
Larry_Parnell16k wrote:

In general, it is far more efficient to run the one exon sequence against the 5000 ESTs than to run 5000 searches against a single sequence. The smaller number of seqs comprises your query set and the larger number your database, in general, and especially for BLAST.

Both answers by Jarretinha and Pierre are good. This does seem like an important set of ESTs to you and so you're likely to use them again in other searches. You wouldn't want to always run the 5000 against the newest idea or exon you have on hand.

ADD COMMENTlink written 8.4 years ago by Larry_Parnell16k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1133 users visited in the last hour