Question

Fastest (non-pseudo) aligner for RNA-seq illumina seq data (year 2019)

0

Entering edit mode

5.1 years ago

enxxx23 ▴ 280

What is the fastest (non-pseudo) aligner for RNA-seq illumina seq data today in year 2019?

RNA-Seq • 2.7k views

ADD COMMENT • link updated 5.1 years ago by Kristoffer Vitting-Seerup ★ 4.0k • written 5.1 years ago by enxxx23 ▴ 280

1

Entering edit mode

fastest is a relative metric unless you do an apples-to-apples comparison on hardware you will eventually use. Choice of alignment options can have a significant impact on speed. It would be difficult to get those to align well among aligners.

bbmap.sh from BBMap suite will stand with the best of them on any given day.

~~bwa mem may be the smallest memory footprint aligner (~6-7G for human genome).~~

ADD REPLY • link 5.1 years ago by GenoMax 141k

0

Entering edit mode

BWA MEM is not a RNA-seq aligner by design.

ADD REPLY • link 5.1 years ago by enxxx23 ▴ 280

0

Entering edit mode

Sure. While the statement is true if you are looking for a splice-aware aligner it is not applicable in this case. Though you could use it if you were dealing with bacterial RNAseq data.

ADD REPLY • link 5.1 years ago by GenoMax 141k

0

Entering edit mode

True, I should have mentioned the target organisms, which in this case are eukaryotes. Retroviruses do not have RNA so doing RNA-seq on retroviruses is the only choice! ;-)

ADD REPLY • link 5.1 years ago by enxxx23 ▴ 280

0

Entering edit mode

what do you mean by

Retroviruses do not have RNA

ADD REPLY • link 5.1 years ago by Nicolas Rosewick 11k

0

Entering edit mode

I don't believe that speed only is a valuable concept. I could write an EXTREMELY fast aligner (with terrible accuracy).

ADD REPLY • link 5.1 years ago by WouterDeCoster 47k

1

Entering edit mode

Challenge accepted !

ADD REPLY • link 5.1 years ago by Nicolas Rosewick 11k

3

Entering edit mode

for read in SeqIO.parse("reads.fastq.gz"):
    pass

ADD REPLY • link 5.1 years ago by WouterDeCoster 47k

0

Entering edit mode

ok great. Now submit to Nature Methods

ADD REPLY • link 5.1 years ago by Nicolas Rosewick 11k

0

Entering edit mode

They'd accept it, for sure.

ADD REPLY • link 5.1 years ago by Kevin Blighe 87k

0

Entering edit mode

At last try to :

read.seq == reference

ADD REPLY • link 5.1 years ago by Bastien Hervé 5.3k

0

Entering edit mode

if read in reference: print('Aligned!')

ADD REPLY • link 5.1 years ago by ddeemer ▴ 10

0

Entering edit mode

STAR is for me, the best compromise between running time and efficiency

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5792058/

ADD REPLY • link 5.1 years ago by Bastien Hervé 5.3k

1

Entering edit mode

I think STAR and HiSAT are probably similar. STAR requires less tuning, but HiSAT can be tuned to give very similar accuracy performance. The benefit of HiSAT is that it uses much less memory.

ADD REPLY • link 5.1 years ago by i.sudbery 19k

score 1 · Answer 1 · 2019-03-04

1

Entering edit mode

5.1 years ago

Kristoffer Vitting-Seerup ★ 4.0k

The better choices are:

Both are very fast and highly accurate. Unless you have large differences in GC content there is no better tool - then they perform very similar. If you have a high GC-content Salmon is probably the better option due to its gcBias algorithm.

ADD COMMENT • link 5.1 years ago by Kristoffer Vitting-Seerup ★ 4.0k

0

Entering edit mode

OP explicitly mentioned being looking for aligners, not pseudo-aligners...

ADD REPLY • link 5.1 years ago by WouterDeCoster 47k

0

Entering edit mode

Woops - Missed the -non :D

ADD REPLY • link 5.1 years ago by Kristoffer Vitting-Seerup ★ 4.0k