Entering edit mode
11 weeks ago
rajdeepboral00
▴
70
I have a tpm matix of my samples from RNA-Seq data, now I want to plot the expression of a particular gene Lets say X. Should i directly plot it using the tpm values or any kind of scaling is necessary? Also, which test should i imply to check for the statistical significance?
Thankyou in advance.
To add to what @ATPoint was saying:
TPM is fine for comparing the expression of two genes within a single condition (or at least, is probably the best thing we have). I'd probably recommend using log TPM for this type of thing. Its not entirely clear what the correct distribution for log TPM is, but t-tests probably work well enough in this situation.
TPM is not good for comparing the expression of a gene across two or more conditions. For this use proper count modeling with DESeq2/edgeR/limma-voom.
I just wanted to know if this is a correct way to plot the expression of my particular gene of interest. I have run deseq2 and took the normalized counts of the deseq2 itself to plot for my gene of interest. Moreover Is the y-axis appropriate?
Yes, the y-axis is reasonable. ALthough the test for differences should come from DESeq, rather than from a wilcox in this case. With four replicates in each condition, I'd say its more or less impossible for there to be a significant diffrence via a wilcox test.
Scaling is only necessary if you have genes with different expression levels, so you probably can go ahead. For differential expression of RNA-seq there are packages such as limma, edgeR and DESeq2. Starting from TPM is not good, asked many times before, please search for it.