Question: Can NUCmer align a short sequence with a much longer sequence (local alignment)?
0
gravatar for nchuang
4.1 years ago by
nchuang230
United States
nchuang230 wrote:

Hey guys,

I wrote a script in python to call nucmer to align a query sequence with a 10k nucleotide sequence. The query sequence size varies from 20 bases to as large as 2000 bases. It seems like nucmer will find a match with the larger sized queries but it can't with smaller. Is this just not what nucmer/mummer is designed to do and I should just stick with using Smith-Waterman?

Thanks!

nucmer local alignment mummer • 1.2k views
ADD COMMENTlink modified 4.1 years ago • written 4.1 years ago by nchuang230

I would try MAFFT7.

http://mafft.cbrc.jp/alignment/software/algorithms/algorithms.html

" Updated! (2015/Jun) Parameters for E-INS-i have been changed in version 7.243. The new parameters work better for aligning a set of long sequences and short sequences that are closely related to each other. To disable this change, add the --oldgenafpair option.

With the new parameters, E-INS-i may be able to align multiple cDNAs and multiple genomic sequences of a gene from closely related species. However, it consumes large memory space when the sequences are long. "

ADD REPLYlink modified 4.1 years ago • written 4.1 years ago by natasha.sernova3.7k

oh I guess that's another option. I was hoping to keep all the code I wrote for NUCmer reading/parsing. I liked how Mummer spits out a coordinate file.

ADD REPLYlink written 4.1 years ago by nchuang230
1
gravatar for nchuang
4.1 years ago by
nchuang230
United States
nchuang230 wrote:

I figured it out!

you have to set the minimum cluster size -c to as small as the query you are using. The default is 65.

ADD COMMENTlink written 4.1 years ago by nchuang230
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 890 users visited in the last hour