Hi,
I have been trying to find the difference between the above two online for a while now, but I haven't got a satisfactory answer. I also didn't find a similar question on Biostars, so I thought of formally asking it now.
Tximport (and maybe other tools too) gives a couple of outputs for each gene, and two of them are - abundance and counts. What is the difference between them?
This paper gives a general idea that count based methods assign reads to genes directly, whereas abundance based methods assign abundance of each transcript with a probabilistic model that makes use of info such as fragment length distribution etc.
So, having said that, is this really the difference between the abundance and count values that I get for any gene from Tximport (or any tool in general)? And, in which situation is one of them a more meaningful/desirable quantity?
Abundance just means a quantification of the expression level. Raw counts without any kind of normalization is not a very accurate measure of abundance, but many software tools want raw counts for input because they do their own normalization. This is probably why Tximport has both, but I don't know the exact method Tximport uses to calculate abundance. In general, abundance could be TMM-normalized counts, TPM values, or any other kind of gene expression measure.
Have you checked the "Use with downstream Bioconductor DGE packages" section of the tximport vignette? That part addresses this question
Hi igor, thanks for the response. Yes, I had gone through that section before and went through it again now, but I didn't find any clear explanation for the difference between abundance and counts in general