Question: Is There Any Other Method For Isoform-Level Differential Expression Analysis Except Cufflinks/Cuffdiff?
8
gravatar for Huanwei Wang
6.1 years ago by
Huanwei Wang260
Shanghai, China
Huanwei Wang260 wrote:

Hello, all!

I'm a beginner for RNA-seq data analysis.

As far as I know, there are many tools for gene expression(such as cufflinks/HTSeq-Count/even bedtools for RPKM) and differential expression analysis(CuffDiff/DESeq/EdgeR) of RNA-seq. However, for isoform-level analysis, is there any other method except cufflinks/cuffdiff? And which method or pipeline do you think is the best?

Thank you!

cufflinks cuffdiff rna-seq • 15k views
ADD COMMENTlink modified 4.9 years ago by Charles Warden6.5k • written 6.1 years ago by Huanwei Wang260
8
gravatar for Charles Warden
4.9 years ago by
Charles Warden6.5k
Duarte, CA
Charles Warden6.5k wrote:

I have personally not been very impressed with the transcript-level qualifications, especially with RNA-Seq data with very uneven coverage (which is almost all the data I have directly worked with, although I've heard there are newer protocols to assist with the problem of uneven coverage).  For this reason, I would tend to stick towards analyzing differential splicing events (exon skipping, intron retention, etc.) with MATS, MISO, etc. over whole transcript quantification (and I tend to use gene-level mRNA quantification rather than transcript-level mRNA quantification)

That said, you could take the transcript abundances from cufflinks, RSEM, etc. and treat them like a normal, gene-level differential expression experiment (using limma, sRAP, etc.).  I don't think this a great solution, but I don't know if it is really much worse than using cuffdiff.

To get an idea about the robustness of gene-level vs. transcript-level differential expression would look like, you can see Figure 5 in the following paper (although it may not be a completely fair comparison because the gene-level and transcript-level expression will often be correlated, and discrepancies between transcript abundances shouldn't be expected for all genes):

http://bioinfo.aizeonpublishers.net/content/2013/6/bioinfo285-292.pdf

 

 

ADD COMMENTlink modified 4.9 years ago • written 4.9 years ago by Charles Warden6.5k

HI Charles,

Could you explain me the difference between gene-level mRNA quantification and transcript-level mRNA quantification

ADD REPLYlink written 4.0 years ago by Varun Gupta1.1k

The precise difference may vary depending what tool you use, but gene-level quantification will give you a single expression value per gene (associated with a gene symbol, most likely).  In contrast, transcript-level quantification will give you expression values for each isoform of a gene, and this can only be calculated when using a tool that assigns reads across isoforms for a given gene (often associated with a RefSeq accession number, but that would obviously depend on what reference you used to define your genes).

I would define the gene-level quantification as the sum of reads assigned to exons present in any of the isoforms for a gene, but I'm not sure if this is always the case.

ADD REPLYlink modified 4.0 years ago • written 4.0 years ago by Charles Warden6.5k
4
gravatar for Ryan Dale
6.1 years ago by
Ryan Dale4.8k
Bethesda, MD
Ryan Dale4.8k wrote:

Have a look at MISO (paper). The documentation is quite good, too.

ADD COMMENTlink written 6.1 years ago by Ryan Dale4.8k
1

That document is quite impressive

ADD REPLYlink written 6.1 years ago by Puriney330
3
gravatar for Mikael Huss
6.1 years ago by
Mikael Huss4.6k
Stockholm
Mikael Huss4.6k wrote:

Yes, there is EBSeq, which is often coupled to RSEM as the isoform quantifier. The EBSeq paper was just published.

EDIT: I forgot about ALEXA-seq, which is a good choice for human and mouse RNA-seq.

Then there is the possibility to obtain isoform-level read counts, which could be done by CuffDiff, RSEM or eXpress (although the latter works by mapping to a reference transcriptome, so it is a bit different from the others), and to then use DESeq/edgeR/limma/SAMSeq ... (insert favorite DE tool). However, I have heard it argued that this is not correct because the DE tools build on assumptions about read counts that apply to genes but not isoforms. For example, the EBSeq paper abstract puts it in this way:

When isoform DE is of interest, investigators often apply gene-level (count-based) methods directly to estimates of isoform counts. Doing so is not recommended. In short, estimating isoform expression is relatively straightforward for some groups of isoforms, but more challenging for others. This results in estimation uncertainty that varies across isoform groups. Count-based methods were not designed to accommodate this varying uncertainty and consequently application of them for isoform inference results in reduced power for some classes of isoforms and increased false discoveries for others.

ADD COMMENTlink modified 6.1 years ago • written 6.1 years ago by Mikael Huss4.6k
1

I believe the newer DEXSeq (from the DESeq people) will no do isoforms because it evaluates differential expression at the level of exons and not genes.

ADD REPLYlink written 6.1 years ago by Dan Gaston7.1k

Can you compare the tools you mentoined with cufflink/cuffdiff?Which is more easy to use, more exact?

ADD REPLYlink written 6.1 years ago by Huanwei Wang260

It's hard to compare. I find Cufflinks/Cuffdiff straightforward to use but that could be because I am more used to them. I have run into some practical issues running RSEM but ultimately got it, and EBSeq, to work fine. However, it is very hard to say which is more accurate because of course I do not have any ground truth for the experiment I was looking at. There is a good comparison of differential expression methods for RNA-seq (http://arxiv.org/abs/1301.5277) but it only looks at the gene level (which of course is a good first step), not the isoform level. It would be cool with an isoform-level version of that paper.

ADD REPLYlink written 6.1 years ago by Mikael Huss4.6k

Hi Mikael,

Have you used EBSeq? I have paired/matched samples, and I need a tool that takes this factor into consideration. I have used the latter alternative you mention: estimating isoform expression with CuffDiff and then doing a paired test in edgeR.

However I would prefer to use a tool that does both expression estimate and DE. Do you know if EBSeq supports paired samples?

Thanks,

Maria

ADD REPLYlink written 3.6 years ago by mjg20
1

I don't think it supports paired samples (please correct me if I am wrong, someone). Right now I am waiting for Sleuth (https://groups.google.com/forum/#!topic/kallisto-sleuth-users/WxWLcmE1Eeo) which goes together with the Kallisto transcript-estimation software and which will include functionality for specifying designs (such as paired samples). That would allow for isoform-level expression estimation  together with factorial design aware DE analysis. You *could* also try Ballgown, which starts from Cufflinks assemblies, estimates expression levels of contigs (i e transcripts) which you can then do paired DE analysis on. However, if you have a well annotated genome you will probably lose some sensitivity in this way.

ADD REPLYlink written 3.6 years ago by Mikael Huss4.6k
2
gravatar for Ann
4.9 years ago by
Ann2.2k
Concord NC USA
Ann2.2k wrote:

Also give Sailfish a try.

See:

http://www.cs.cmu.edu/~ckingsf/software/sailfish/

Link to recently published Sailfish paper: http://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.2862.html

 

ADD COMMENTlink written 4.9 years ago by Ann2.2k

Sailfish has been massively updated in the past month; the underlying algorithm has been improved considerably.  The newest release is markedly faster, uses fewer resources, and is more accurate than the previous stable version.  Some interesting new features (e.g. variance estimation of transcript expression via Gibbs sampling and bootstraping) have also been added.  If you haven't tried out the new version, I recommend you give it a spin!

ADD REPLYlink written 3.4 years ago by Rob3.2k
1
gravatar for fo3c
6.1 years ago by
fo3c420
.eu
fo3c420 wrote:

Have you seen MMSEQ?

Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads

ADD COMMENTlink written 6.1 years ago by fo3c420

It is an interesting paper which I hadn't seen, but it seems to be about isoform level quantification and not about isoform level differential expression analysis as the question specifies

ADD REPLYlink written 6.1 years ago by Mikael Huss4.6k
0
gravatar for vipin.ts
6.1 years ago by
vipin.ts60
New York City
vipin.ts60 wrote:

I think rquant will perform isoform level quantification bioweb.me/rquant

ADD COMMENTlink modified 6.1 years ago • written 6.1 years ago by vipin.ts60
0
gravatar for Leszek
6.1 years ago by
Leszek4.0k
IIMCB, Poland
Leszek4.0k wrote:

In ENCODE RNA-Seq pipeline they used Flux Capacitor to estimate isoform expression.
I had difficult time running it though.

ADD COMMENTlink written 6.1 years ago by Leszek4.0k

Anyone evaluating Flux Capacitor (which I have not myself used) might be interested in this blog post from Lior Patcher on the subject.
 

ADD REPLYlink written 4.9 years ago by David Quigley11k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 829 users visited in the last hour