Question: Is Stringtie a suitable tool for both gene and transcript quantification?
0
gravatar for tianshenbio
4 months ago by
tianshenbio50
tianshenbio50 wrote:

I am thinking about generating read count matrix at both gene-level and transcript (isoform)-level.

According to a previous post:

Why run FeatureCounts after Stringtie? (Galaxy recommends!)

How to get read counts on transcript level using featurecounts?

It seems that I can use FeatureCounts for gene quantification and Stringtie for transcript/isoform quantification, am I right?

Since transcripts are heavily overlapping, featurecounts cannot properly sort out reads mapping to the same exon, thus is not suitable to count transcripts/isoforms. Then how this can be overcome in Stringtie? Are common reads properly sorted using stringtie?

Many people suggested an alignment-free tool, Salmon, for transcript quantification. Since I am interested to find both DE genes and DE transcripts/isoforms in my DE analysis, I assume Stringtie would be a more handy option since I can get both gene and transcript counts in one run.

Therefore, my question would be, is gene/transcript quantification reliable using Stringtie? How does it distribute common reads shared by multiple isoforms, which is the major problem to quantify isoforms.

I have read the original papers and related posts here in biostars but still not sure...appreciate it if someone can clarify this for me.

ADD COMMENTlink modified 4 months ago by i.sudbery9.4k • written 4 months ago by tianshenbio50
6
gravatar for i.sudbery
4 months ago by
i.sudbery9.4k
Sheffield, UK
i.sudbery9.4k wrote:

Every benchmark I have seen (as well as my own experence) shows that StringTie is less accurate that Salmon/kalisto/RSEM. I don't actaully know what the model at the heart of StringTie's quantification is. Salmon/kalisto/RSEM all use some variation of EM to distribute reads between transcripts.

As @ATpoint points out, its fairly easy to calculate gene expression from transcript expression.

ADD COMMENTlink written 4 months ago by i.sudbery9.4k

Thank you for your response. Will give it a try!

ADD REPLYlink written 4 months ago by tianshenbio50
4
gravatar for ATpoint
4 months ago by
ATpoint40k
Germany
ATpoint40k wrote:

You can easily aggregate the salmon transcript level abundance estimates to the gene level with the tximport package from Bioconductor. I would definitely go with salmon. From what I understand stringtie is mainly used to assemble reads into a transcriptome and I probably would only use it for that.

ADD COMMENTlink modified 4 months ago • written 4 months ago by ATpoint40k

Thank you for your reply. Yeh, since I will not perform transcriptome assembly, Salmon might be a better choice.

ADD REPLYlink written 4 months ago by tianshenbio50
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1273 users visited in the last hour