Question: Difference between abundance and counts in RNA-Seq
gravatar for c_u
9 months ago by
United States
c_u250 wrote:


I have been trying to find the difference between the above two online for a while now, but I haven't got a satisfactory answer. I also didn't find a similar question on Biostars, so I thought of formally asking it now.

Tximport (and maybe other tools too) gives a couple of outputs for each gene, and two of them are - abundance and counts. What is the difference between them?

This paper gives a general idea that count based methods assign reads to genes directly, whereas abundance based methods assign abundance of each transcript with a probabilistic model that makes use of info such as fragment length distribution etc.

So, having said that, is this really the difference between the abundance and count values that I get for any gene from Tximport (or any tool in general)? And, in which situation is one of them a more meaningful/desirable quantity?

rna-seq tximport • 1.1k views
ADD COMMENTlink modified 9 months ago • written 9 months ago by c_u250

Abundance just means a quantification of the expression level. Raw counts without any kind of normalization is not a very accurate measure of abundance, but many software tools want raw counts for input because they do their own normalization. This is probably why Tximport has both, but I don't know the exact method Tximport uses to calculate abundance. In general, abundance could be TMM-normalized counts, TPM values, or any other kind of gene expression measure.

ADD REPLYlink written 9 months ago by colin.kern920

Have you checked the "Use with downstream Bioconductor DGE packages" section of the tximport vignette? That part addresses this question

ADD REPLYlink modified 9 months ago • written 9 months ago by igor11k

Hi igor, thanks for the response. Yes, I had gone through that section before and went through it again now, but I didn't find any clear explanation for the difference between abundance and counts in general

ADD REPLYlink written 9 months ago by c_u250
gravatar for Devon Ryan
9 months ago by
Devon Ryan96k
Freiburg, Germany
Devon Ryan96k wrote:

A count is simply that, a count of reads on some feature. An abundance is a more biologically meaningful (though not necessarily statistically useful) quantification of expression of a gene or transcript that is normalized in some way. Most commonly in this is TPM or some variant of that, but it could also be "copies per cell", which would be an abundance metric you could get from rt-qPCR. In other words, normalized counts aren't an abundance estimate since reads aren't a thing present in the cell, but an artifact of how we perform library prep and sequencing. The exception to this would be if you use a minion or equivalent to sequence full-length transcripts, since then a normalized count would estimate the abundance (on some likely relative scale) of a transcript in a cell or tissue.

ADD COMMENTlink written 9 months ago by Devon Ryan96k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1480 users visited in the last hour