Question: Best Tool For Aligning Short Sequence Against Genome
gravatar for BruceB
6.8 years ago by
Cambridge, UK
BruceB330 wrote:

I have a short sequence (34bp) that I would like to align against the mouse genome. Probably a bit of an odd question, so let me explain in more detail...

At the moment, I can predict regions where this sequence could be present (based on experimental data from our lab). So I take the reference genome from these regions, use ClustalW2 and align my 34bp sequence. It aligns where I expected it to. The alignment is poor: there are mismatches and gaps but this is to be expected as the purpose of this is to troubleshoot a problem in our targeted resequencing.

Now I've exhausted the regions we know/think this sequence occurs in and would like to generate a list of other positions where this sequence could also be found. The alignment doesn't need to be perfect, I'm after an indication of where these sequences are found.

Of course, ClustalW2 isn't good for this alignment as the reference is simply too large. What I'm looking for is a tool that can performed the gapped and mismatched alignment I'm getting from ClustalW2 but across the whole genome.

Is there such a tool and does anyone have any experiences with doing something similar?

short sequence alignment • 6.7k views
ADD COMMENTlink modified 6.8 years ago by Prakki Rama2.4k • written 6.8 years ago by BruceB330
gravatar for jcastrofigueroa
6.8 years ago by
Norwich, UK
jcastrofigueroa140 wrote:

Hello: As NicoBxl suggested you can do local blast on your computer. A handy way for getting results when you blast a very short sequences (like yours) against a long database of DNA is using the parameter "-task blastn-short" or/and making the "-evalue" very high (10 or more). At least these work fine for me. I hope I helped.

ADD COMMENTlink modified 6.8 years ago • written 6.8 years ago by jcastrofigueroa140
gravatar for Prakki Rama
6.8 years ago by
Prakki Rama2.4k
Prakki Rama2.4k wrote:

You can use BLAT as well. The standalone is available here

ADD COMMENTlink modified 6.8 years ago • written 6.8 years ago by Prakki Rama2.4k
gravatar for Nicolas Rosewick
6.8 years ago by
Belgium, Brussels
Nicolas Rosewick9.0k wrote:

Try a good old Blast :

ADD COMMENTlink written 6.8 years ago by Nicolas Rosewick9.0k

Wonderful suggestion. I was using the default parameters which weren't dealing with the mismatches or gaps very well. A bit of tweaking has given me a list of positions that almost perfectly predict the regions where said sequencing problem has occurred. A little more tweaking and I should have a really good list of potentially problematic regions.

ADD REPLYlink written 6.8 years ago by BruceB330
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1520 users visited in the last hour