Question: Difference between abundance and counts in RNA-Seq
gravatar for c_u
4 weeks ago by
United States
c_u140 wrote:


I have been trying to find the difference between the above two online for a while now, but I haven't got a satisfactory answer. I also didn't find a similar question on Biostars, so I thought of formally asking it now.

Tximport (and maybe other tools too) gives a couple of outputs for each gene, and two of them are - abundance and counts. What is the difference between them?

This paper gives a general idea that count based methods assign reads to genes directly, whereas abundance based methods assign abundance of each transcript with a probabilistic model that makes use of info such as fragment length distribution etc.

So, having said that, is this really the difference between the abundance and count values that I get for any gene from Tximport (or any tool in general)? And, in which situation is one of them a more meaningful/desirable quantity?

rna-seq tximport • 269 views
ADD COMMENTlink modified 29 days ago • written 4 weeks ago by c_u140

Abundance just means a quantification of the expression level. Raw counts without any kind of normalization is not a very accurate measure of abundance, but many software tools want raw counts for input because they do their own normalization. This is probably why Tximport has both, but I don't know the exact method Tximport uses to calculate abundance. In general, abundance could be TMM-normalized counts, TPM values, or any other kind of gene expression measure.

ADD REPLYlink written 4 weeks ago by colin.kern750

Have you checked the "Use with downstream Bioconductor DGE packages" section of the tximport vignette? That part addresses this question

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by igor8.8k

Hi igor, thanks for the response. Yes, I had gone through that section before and went through it again now, but I didn't find any clear explanation for the difference between abundance and counts in general

ADD REPLYlink written 4 weeks ago by c_u140
gravatar for Devon Ryan
29 days ago by
Devon Ryan92k
Freiburg, Germany
Devon Ryan92k wrote:

A count is simply that, a count of reads on some feature. An abundance is a more biologically meaningful (though not necessarily statistically useful) quantification of expression of a gene or transcript that is normalized in some way. Most commonly in this is TPM or some variant of that, but it could also be "copies per cell", which would be an abundance metric you could get from rt-qPCR. In other words, normalized counts aren't an abundance estimate since reads aren't a thing present in the cell, but an artifact of how we perform library prep and sequencing. The exception to this would be if you use a minion or equivalent to sequence full-length transcripts, since then a normalized count would estimate the abundance (on some likely relative scale) of a transcript in a cell or tissue.

ADD COMMENTlink written 29 days ago by Devon Ryan92k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1167 users visited in the last hour