Question: Computing TPM from normalized read count from DESEQ
0
gravatar for jack
20 months ago by
jack750
Germany
jack750 wrote:

Hey all,

I have normalised read count from DESeq2 for miRNA. I would like to compute TPM out of this normalised read count. I was wondering whether would it be possibile to do that ?

ADD COMMENTlink modified 20 months ago by geek_y9.4k • written 20 months ago by jack750
2
gravatar for geek_y
20 months ago by
geek_y9.4k
Barcelona/CRG/London/Imperial
geek_y9.4k wrote:

TPM itself is a normalisation method, so you do it on raw counts, not on normalised data. If you want to know how to calculate TPM, check this link.

http://www.rna-seqblog.com/rpkm-fpkm-and-tpm-clearly-explained/

I am not sure if you could apply TPM for miRNA.

ADD COMMENTlink modified 20 months ago • written 20 months ago by geek_y9.4k

I am not sure if you could apply TPM for miRNA.

I second this. You calculate TPM to eliminate the length-bias (longer genes get more reads), but I don't think this doesn't make sense for miRNA sequencing.

Also: if you have normalised read counts from DESeq2 why would you want to get TPM values?

ADD REPLYlink written 20 months ago by WouterDeCoster38k

Need clarification:

longer genes get more reads

Isn't that the biological variation you are trying to normalize (independent to the length a gene can be can be poorly/highly transcribed depend on the condition) ?

Does it matter when you do between sample comparisons where we compare one/same gene in different samples/biological conditions assuming same library sizes (not like within sample where length matters because comparing different genes from the same sample/biological condition) ?

ADD REPLYlink modified 20 months ago • written 20 months ago by EagleEye6.2k

I'm not sure what you mean to say.

Isn't that the biological variation you are trying to normalize (independent to the length a gene can be can be poorly/highly transcribed depend on the condition) ?

I don't consider read length "biological" but a technical element.

Does it matter when you do between sample comparisons where we compare one/same gene in different samples/biological conditions assuming same library sizes (not like within sample where length matters because comparing different genes from the same sample/biological condition)?

I would say gene length doesn't matter when doing differential expression analysis between genes. However, when comparing transcript usage it does play a role.

The point I wanted to make (poorly, in hindsight) is that when you have DESeq2 normalized counts (which I consider superior to TPM) there is no real need to use something else (except if OP has good reasons to do so - but then still normalization of normalized counts is not a good idea as pointed out by geek_y).

ADD REPLYlink written 20 months ago by WouterDeCoster38k
1

I don't consider read length "biological" but a technical element.

I was not talking about the read length, it was about the number of reads mapped to a transcript/gene. Sorry I did not notice your last point.

Also: if you have normalised read counts from DESeq2 why would you want to get TPM values?

I agree with you for differential expression analysis it is better to consider normalization from DESeq2/edgeR (always superior, whatever packages designed to handle RNA-seq data). I guess OP might consider converting raw counts to TPM only if want to check the expression levels without going for diffExp.

normalization of normalized counts is not a good idea

Very bad idea.

ADD REPLYlink written 20 months ago by EagleEye6.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2327 users visited in the last hour