Question: What Is The Fastest Alignment Tool To Map Reads To A Big Genome Over 4Gb?
3
gravatar for Dejian
6.7 years ago by
Dejian1.2k
United States
Dejian1.2k wrote:

We assembled a genome around 5GB. Now we are trying to map the Illumina reads to the genome. There are many aligners that can do this job. But which is the fastest one? Have anyone benchmarked any of these tools?

The list of aligners is as follows: short-read aligners.

short aligner • 2.9k views
ADD COMMENTlink modified 3 months ago by predeus540 • written 6.7 years ago by Dejian1.2k
6
gravatar for Istvan Albert
6.7 years ago by
Istvan Albert ♦♦ 77k
University Park, USA
Istvan Albert ♦♦ 77k wrote:

Asking for the fastest tool is posing the question the wrong way.

What you need is finding a fast enough method for your data. As Jeremy points out there are many questions that need to be answered and you will need to run your own tests to identify which tool is fast enough and provides the necessary coverage as well. Your genome is likely to contain vast streches of low complexity regions - that could make some approaches perform differently.

Start with the known fast tools like bowtie and SOAP and see what these do. Also read the literature: Comparative analysis of algorithms for next-generation sequencing read alignment

ADD COMMENTlink modified 6.7 years ago • written 6.7 years ago by Istvan Albert ♦♦ 77k
3
gravatar for Ido Tamir
6.7 years ago by
Ido Tamir4.8k
Austria
Ido Tamir4.8k wrote:

There are some comparisons the latest to my knowledge is Ruffalo. But is being the fastest really an important criteria, if the difference to more accurate results is 2x running time?

ADD COMMENTlink written 6.7 years ago by Ido Tamir4.8k

Currently we want to get a rough result of the coverage, so speed is the first consideration. Thanks for the paper recommended.

ADD REPLYlink written 6.7 years ago by Dejian1.2k
3
gravatar for Jeremy Leipzig
6.7 years ago by
Philadelphia, PA
Jeremy Leipzig17k wrote:

you need a lot more criteria to make this a fair fight
here are some just off the top of my head

  1. number of mismatches
  2. memory ceiling
  3. paired-end?
  4. quality-aware?
  5. sensitivity
ADD COMMENTlink written 6.7 years ago by Jeremy Leipzig17k

Currently we want to get a rough result of the coverage, so speed is the first consideration. Thanks for your suggestion.

ADD REPLYlink written 6.7 years ago by Dejian1.2k

your comment indicated that you do not understand that accuracy and sensitivity are not independent from coverage. Increasing these will necessarily slow down the method.

ADD REPLYlink written 6.7 years ago by Istvan Albert ♦♦ 77k
1
gravatar for predeus
3 months ago by
predeus540
Russia
predeus540 wrote:

I think it would be interesting to bump it up in 2018.

What is the fastest aligners out there? I have heard (I think Heng Li wrote it somewhere) that with changes in computer architecture, hash-based aligners are coming to be more and more interesting, compared to FM-index based. I am about to try our SNAP, which promises up to 20x increase compared to bowtie2 and bwa.

Like Istvan rightly pointed out, aligner has to be fast for your application. But sometimes speed could be a limiting factor - e.g. when you are teaching a class and want to show something pretty quickly, and not wait for 30 minutes.

ADD COMMENTlink written 3 months ago by predeus540
0
gravatar for Gjain
6.7 years ago by
Gjain5.2k
Göttingen, Germany
Gjain5.2k wrote:

I use Novoalign (http://www.novocraft.com/main/page.php?s=novoalign) which is good and fast.

ADD COMMENTlink written 6.7 years ago by Gjain5.2k
1

is novoalign really fast? I've found it to do a good job, but not especially fast.

ADD REPLYlink written 6.7 years ago by brentp22k

it depends on your definition of what you call fast. If I run my mapping (HG19) pipeline on our cluster with around 800 nodes ... it normally takes 4-5 hours.

ADD REPLYlink written 6.7 years ago by Gjain5.2k

@Gjain: Do you use MPI version? or ...

ADD REPLYlink written 6.5 years ago by Rm7.7k

just normal version

ADD REPLYlink written 6.5 years ago by Gjain5.2k

Hi,I‘m using a software named LSC,which need novoalign to align.But I can’t download novoalign.And I can't open tihs URL.Can you send it to me? My email address is liuyanhu005@gmail.com. Many thanks!

ADD REPLYlink written 5.5 years ago by liuyanhu0050
3

Hi, My apologies for reviving an old thead but I've receently become aware that the default options in LSC for Novoalign are far from ideal.

If you are using Novoalign with LSC you need to use -r Exhaustive not -r All. This will improve results dramatically and should be better than the other aligners.

For 100bp reads try.. novoalign_options = -c1 -r Ex 1000 -t 120 -F FA -g 0 -x 20 -o sam

If read length is different adjust -t accordingly. On Cerebellum reads used by Kin Fai results are 40% better with 30% reduced run time vs BWA using LSC 0.3.1

Best, Colin

ADD REPLYlink written 4.5 years ago by Sparks60

This is very useful ... thanks Colin

ADD REPLYlink written 4.5 years ago by Gjain5.2k

hi, please use this page to review the licence agreement and you can download the one which suits your needs best. http://www.novocraft.com/main/page.php?id=968

ADD REPLYlink written 5.5 years ago by Gjain5.2k

Thank you a lot!

ADD REPLYlink written 5.5 years ago by liuyanhu0050
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 928 users visited in the last hour