RPM data analysis with DESeq2
1
0
Entering edit mode
5.0 years ago
ruhollah ▴ 10

I have a table of RPM data for gene expressions of all human genes in two groups of patients. (each group includes several samples)

I am going to use DESeq2 to explore if there is any gene whose expression significantly changes between two groups.

DESeq2, however, seems like accepts only nu-normalized counts for the RNA seq. Is there a way to feed the DESeq2 with RPM data?

DESeq2 RPM data • 3.5k views
ADD COMMENT
0
Entering edit mode

Cross-posted on Bioconductor: https://support.bioconductor.org/p/120164/

ADD REPLY
1
Entering edit mode
5.0 years ago

I think DESeq2 expects counts, so passing it some normalized values (like TPM, RPKM, FPKM) is probably not optimal.

Do you have RPKM, or RPM (Reads-per-million)? If you have the latter, you can convert them to RPKM by dividing each RPM by the length of the gene in kilobases. If you want to get counts, you could multiply your RPM by (Number of reads in Sample / 1 000 000) and that should give you counts, which you should be able to use with DESeq. Make sure you know whether you have RPMs or RPKMs.

See also: https://support.bioconductor.org/p/56275/#56299, which suggests converting back to count preferably, or if you're stuck with RPKM then use limma instead of DESeq2.

Also, handy reference on RPKM, FPKM, TPM measures: https://www.rna-seqblog.com/rpkm-fpkm-and-tpm-clearly-explained/

ADD COMMENT
1
Entering edit mode

Thank you so much for clarification! I got the count data. I will work on it. This is my first experience w/ DESeq though

ADD REPLY

Login before adding your answer.

Traffic: 2749 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6