TCGA expression data (TPM)
Entering edit mode
3.2 years ago
Folder40g ▴ 160

I'm using a software that requieres either TPM or raw counts at gene level.

So I downloaded this data set from Xena browser:

This matrix seems to be the log2(RSEM output + 1)

Am I wrong If said that by doing the antilog2 of the counts of this matrix, then subtract 1 I get TPM?

I've not been able to find raw counts at gene level. TCGAbiolinks as far as I've seen only provides htseq counts at transcript level.


TCGA TPM expression TCGAbiolinks • 4.4k views
Entering edit mode

TPM and RPKM/FPKM are highly correlated at gene level quantification. TPM use transcript length to normalize. FPKM use gene length to normalize. And the transcript length is highly correlated with gene length. My take is that both TPM and RPKM/FPKM are reasonable estimates for expression levels, my experience is changing between the two rarely give drastically different results, none the less, you probably want to be consistent across all your datasets in consideration so that you don't pull out signal that is due to difference in metric.


Login before adding your answer.

Traffic: 2089 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6