Question

Star Or Tophat?

15

Entering edit mode

10.3 years ago

lkmklsmn ▴ 970

Hi,

I am analyzing RNA seq experiment and I would like to hear what you guys think about the STAR and Tophat alignment programs. Which one do you prefer? Why? Pros and Cons of both of them.

rnaseq alignment tophat • 33k views

ADD COMMENT • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by lkmklsmn ▴ 970

1

Entering edit mode

After 7 years I would say STAR

ADD REPLY • link 3.3 years ago by DareDevil ★ 4.3k

Ram · Answer 1 · 2014-01-17

33

Entering edit mode

10.3 years ago

Devon Ryan 104k

STAR is better in most ways, from mapping accuracy to speed. The big caveat to STAR is that you need a good bit of RAM. For a nice objective look at STAR and other RNAseq aligners, I would recommend that you have a quick read through this recent and very thorough comparison from the RNA-seq Genome Annotation Assessment Project in Nature Methods (there's a similar comparison by the same collaboration for transcript reconstruction in the same issue).

BTW, the take-home message from that paper can probably be summed up from Figure 3 (the paper is open access, so this is a direct link) Mapping accuracy comparison from Engström et al. 2013

Edit: Have a look at IV's answer as well. I hadn't mentioned Gsnap, but I can also say that it's always produced very good results if you have an annotation (this seems to be confirmed in the review that I linked to).

ADD COMMENT • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by Devon Ryan 104k

0

Entering edit mode

TopHat2 (especially with annotations) looks quite good to me based on just that figure. I'll have to re-read the paper to remember what "partly correctly mapped means" and whether that could cause problems.

ADD REPLY • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by brentp 24k

1

Entering edit mode

Yeah, tophat2 is still a pretty good all around option. The biggest downside is how long it takes to run.

ADD REPLY • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by Devon Ryan 104k

3

Entering edit mode

On our architecture STAR can map 60 million reads in about 4 mins. We have had tophat 2 take about 2-6 days on the same data.

ADD REPLY • link updated 4.3 years ago by Ram 43k • written 10.0 years ago by Alastair Kerr 5.3k

Ram · Answer 2 · 2014-01-18

To my opinion, some of the most important pros and cons:

Tophat

Pros

Widely used + huge community to ask questions in fora
No fuss connections with cufflinks and any other Tuxedo pipeline tool
A great part of published results are based on this aligner and is widely accepted
Provides a ready to use junction file

Cons

Really slow response rates from the relevant helpdesk email
Doesn't do read clipping for partial read alignment (which is really useful in many scenarios)
Inner mate distance and sd have to be calculated beforehand for optimal performance

Star

Pros

Super fast
The latest versions get really good statistics in comparisons
Can do read clipping
It has a mode of output compatible with cufflinks
Provides a ready to use junction file

Cons

The first versions had many issues.
Not so many users as other aligners but I think that there is a strong user base, especially after ENCODE
I'll add in the list also GSNAP, which we also widely use in the lab

GSNAP

Pros

Always one of the best in any comparison article out there (usually 1st or 2nd)
Can handle partial matches with clipping and indels
Not that resource intensive
The creator really supports the aligner and answers really fast in emails
Gets splicing junctions really well
Has a mode compatible with cufflinks
Provides multiple sam output (concordant, halfmapping, paired, halfmapping, unique, multimapping, etc)

Cons

The last version (with the suffix array) is a lot faster than any previous version but still slower than Star [unless it's run within the ultrafast algorithm max allowed mismatches]
Not so many users as Tophat (even though you can get also really good feedback from Trinity users)

Ram · Answer 3 · 2014-01-17

2

Entering edit mode

10.3 years ago

Ming Tommy Tang ★ 3.9k

STAR is much faster than Tophat

ADD COMMENT • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by Ming Tommy Tang ★ 3.9k

Ram · Answer 4 · 2014-01-17

1

Entering edit mode

10.3 years ago

swbarnes2 14k

TopHat is more widely used, and if you need help with it, there are a lot more users who can help. (see how many people use the TopHat tag over the STAR tag)

ADD COMMENT • link updated 3.0 years ago by Ram 43k • written 10.3 years ago by swbarnes2 14k

Ram · Answer 5 · 2014-04-30

1

Entering edit mode

10.0 years ago

super ▴ 60

STAR is much faster than Tophat, but I don't know which result is more reliable. But I think both are OK

ADD COMMENT • link updated 3.0 years ago by Ram 43k • written 10.0 years ago by super ▴ 60