Entering edit mode
7.2 years ago
jack
▴
980
Hey all,
I have normalised read count from DESeq2 for miRNA. I would like to compute TPM out of this normalised read count. I was wondering whether would it be possibile to do that ?
I second this. You calculate TPM to eliminate the length-bias (longer genes get more reads), but I don't think this doesn't make sense for miRNA sequencing.
Also: if you have normalised read counts from DESeq2 why would you want to get TPM values?
Need clarification:
Isn't that the biological variation you are trying to normalize (independent to the length a gene can be can be poorly/highly transcribed depend on the condition) ?
Does it matter when you do between sample comparisons where we compare one/same gene in different samples/biological conditions assuming same library sizes (not like within sample where length matters because comparing different genes from the same sample/biological condition) ?
I'm not sure what you mean to say.
I don't consider read length "biological" but a technical element.
I would say gene length doesn't matter when doing differential expression analysis between genes. However, when comparing transcript usage it does play a role.
The point I wanted to make (poorly, in hindsight) is that when you have DESeq2 normalized counts (which I consider superior to TPM) there is no real need to use something else (except if OP has good reasons to do so - but then still normalization of normalized counts is not a good idea as pointed out by geek_y).
I was not talking about the read length, it was about the number of reads mapped to a transcript/gene. Sorry I did not notice your last point.
I agree with you for differential expression analysis it is better to consider normalization from DESeq2/edgeR (always superior, whatever packages designed to handle RNA-seq data). I guess OP might consider converting raw counts to TPM only if want to check the expression levels without going for diffExp.
Very bad idea.