Question: Limiting memory usage aligner MAFFT uses
0
gravatar for namarino
13 days ago by
namarino0
namarino0 wrote:

Hello all! I'm using MAFFT to do a L-INS-i alignment of about 35,000 sequences. I'm running MAFFT in the Linux command line, but I'm running into the issue that it is using basically all of the memory on our server. I've tried using the -m command to try to allot memory, but it didn't work. Does anyone have any suggestions?

alignment • 102 views
ADD COMMENTlink modified 13 days ago by Brice Sarver2.8k • written 13 days ago by namarino0
1
gravatar for h.mon
13 days ago by
h.mon27k
Brazil
h.mon27k wrote:

L-INS-i is not the recommended method for large alignments:

About

MAFFT is a multiple sequence alignment program for unix-like operating systems. It offers a range of multiple alignment methods, L-INS-i (accurate; for alignment of <∼200 sequences), FFT-NS-2 (fast; for alignment of <∼30,000 sequences), etc.

Also check the Tips for handling a large dataset page.

ADD COMMENTlink written 13 days ago by h.mon27k
1
gravatar for Brice Sarver
13 days ago by
Brice Sarver2.8k
United States
Brice Sarver2.8k wrote:

My understanding is that this can't be done without modifying the source code. If you have evidence to the contrary, I (and others, I'm sure) would like to see.

The current build from Bioconda w/options:

MAFFT v7.407 (2018/Jul/23)
https://mafft.cbrc.jp/alignment/software/
MBE 30:772-780 (2013), NAR 30:3059-3066 (2002)

High speed:
  % mafft in > out
  % mafft --retree 1 in > out (fast)

High accuracy (for <~200 sequences x <~2,000 aa/nt):
  % mafft --maxiterate 1000 --localpair  in > out (% linsi in > out is also ok)
  % mafft --maxiterate 1000 --genafpair  in > out (% einsi in > out)
  % mafft --maxiterate 1000 --globalpair in > out (% ginsi in > out)

If unsure which option to use:
  % mafft --auto in > out

--op # :         Gap opening penalty, default: 1.53
--ep # :         Offset (works like gap extension penalty), default: 0.0
--maxiterate # : Maximum number of iterative refinement, default: 0
--clustalout :   Output: clustal format, default: fasta
--reorder :      Outorder: aligned, default: input order
--quiet :        Do not report progress
--thread # :     Number of threads (if unsure, --thread -1)
ADD COMMENTlink written 13 days ago by Brice Sarver2.8k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1097 users visited in the last hour