Question

Exonerate: Issues please help

0

Entering edit mode

6.3 years ago

carlosalfonsogonzalez6 ▴ 10

Hello Everyone,

Im currently facing some problems with exonerate(and tried version 2.2 and 2.4), using the next code:

exonerate --model est2genome --softmasktarget yes --bestn 1 --minintron 20 --maxintron 20000 --bigseq yes --query Long_isoform_CDNA.fasta --showvulgar no --target DvirgenomeMasked.fa.out --showalignment no --showtargetgff yes --verbose 0 > AnnoDvir_clean_percent.gff

Persons who have used this program, How do you overcome the slowness problem? I have also see that when i do not used the "--bestn" command wich gives the best aligned secuence the code runs perfeclty.

Hope you can help me with that Thanks Greetings

alignment rna-seq • 2.8k views

ADD COMMENT • link updated 6.3 years ago by Michael 54k • written 6.3 years ago by carlosalfonsogonzalez6 ▴ 10

0

Entering edit mode

How did you install the versions of Exonerate you used?

ADD REPLY • link 6.3 years ago by h.mon 35k

0

Entering edit mode

Following the user guide, ./configure, make, make installl, Do you have an idea about making the programing runnning faster?

ADD REPLY • link 6.3 years ago by carlosalfonsogonzalez6 ▴ 10

0

Entering edit mode

Hi I havent find an answer yet

ADD REPLY • link 6.3 years ago by carlosalfonsogonzalez6 ▴ 10

0

Entering edit mode

Did you check that you're not running out of memory ? --bestn is supposed to speed up exonerate. However, you're using quite a high value for it which may make exonerate work more than needed. Since --model est2genome uses essentially a Smith-Waterman alignment algorithm, --bestn 70 probably forces exonerate to run this for quite a lot of sequences. If the problem is due to the --bestn option then you may be better off not using it and then post processing the output. Also look at the section on 'Strategies for speed' in the man pages.

ADD REPLY • link 6.3 years ago by Jean-Karim Heriche 27k

0

Entering edit mode

Oh, Thank you very much for you answer i use --bestn 1 i dont know what i put it that way on my question, Does you answer still the same? Im trying to map cDNA from Drosophila virilis to the Drosophila melanogaster genome, this with the goal to obtain an anotation for both species... for rna seq analyses. Im open to suggestions.

Thanks!!

ADD REPLY • link 6.3 years ago by carlosalfonsogonzalez6 ▴ 10

score 0 · Answer 1 · 2018-01-03

The debian man-pagel also lists a --cores parameter. You can also try setting the --score parameter and setting --fsmmemory to be close to physical RAM.

Using GMAP could be an alternative.

From the manual:

STRATEGIES FOR SPEED Keep all data on local disks.

Apply the highest acceptable score thresholds using a combination of --score, --percent and --bestn.

Repeat mask and dust the genomic (target) sequence. (Softmask these sequences and use --softmasktarget).

Increase the --fsmmemory option to allow more query multiplexing.

Increase the value for --seedrepeat

When using an alignment model containing introns, set --geneseed as high as possible.

If you are compiling exonerate yourself, see the README file supplied with the source code for details of compile-time optimisations.