Entering edit mode
6 weeks ago
jeni ▴ 90
Imagine I have normalized expression values of two different transcripts from the same gene, but I am insterested in studying the expression at the gene level. Which is the best way to get this? Do you think it would be correct to calculate the mean of the expression levels from those two transcripts? What if a transcript is not expressed and other is?
tximport does something a bit more clever than simply summing up (it factors transcript length into consideration).
If you're given TPMs, you can simply sum those up.
tximport will calculate the weighted effective transcript length, but it doesn't use that in its calculation of gene level expression. Both counts and TPMs are just summed. The effective transcript lengths (which is sum(tx_expression*tx_length/gene_expression)) is then used as an offset in differential testing platforms.