Question: Converting TPM to FPKM
1
22 months ago by
alexahes10
alexahes10 wrote:

Hello.

I have an expression dataset in TPM values that I want to convert to FPKM. The problem is that I have no idea how to do it and it seems like my google-skills are not good enough to find an answer either.

Does anyone here have any experience with converting TPM values to FPKM?

Many thanks!

rna-seq tpm fpkm • 1.3k views
modified 6 weeks ago by Charles Warden6.4k • written 22 months ago by alexahes10
1

Why would you do this, rather than converting FPKM to TPM? There is no benefit to FPKM over TPM and the FPKM to TPM conversion is trivial.

1

There are benefits of FPKM to TPM. FPKM values of the same gene between different samples are proportional to RPM values of that gene, which is very useful for RPM believers, who value more about the mass of a massenger than the molar concentration when comparing different samples. TPM isn't superior to FPKM. Both of them are just estimations dependent on your prior beliefs.

3
22 months ago by
karl.stamm3.4k
United States
karl.stamm3.4k wrote:

Any conversion will have some error to it. Theoretically, the math is a quick conversion, but realistically, both of your data sets are produced by different tools with different biases. The conversion is not going to be correct. Better to try to use both sorts of data, knowing they are simply different. Maybe log-normalization can get you into similar scales without explicitly worrying about units.

1
6 weeks ago by
brianpenghe20
United States
brianpenghe20 wrote:

It is always harder to convert TPM back to FPKM because TPM lost information that FPKM still contains. If you really want to do that, you have to find the normalizing constant, i.e., the ratio between FPKM and TPM since these two values are proportional to each other in the same sample.

I'd suggest getting the raw reads to derive FPKM.

0
6 weeks ago by
Charles Warden6.4k
Duarte, CA
Charles Warden6.4k wrote:

I think everybody has essentially conveyed the right idea: you would typically have those (TPM versus FPKM) with different quantification methods (transcript quantification versus gene quantification).

If you have read counts for transcripts for a transcript TPM value, you could add those at the gene level (which is how you could then calculate FPKM values from the counts provided alongside your TPM values, with your transcript quantification). However, that involved assignment of ambiguous reads between transcripts and multi-mapped reads (which could be between genes). So, your FPKM values with unique reads won't be the same.