Entering edit mode
4.3 years ago
elmahy2005 ▴ 140
Given a dataset of RNA-Seq expression values normalized with the FPKM method, Is it possible to restore the original count dataset or create a new dataset that behaves very similar to the original count matrix (i.e. we can use in Poisson distribution based models)?
Unfortunately, it is not possible to calculate raw counts from RPKM data. Best is to start with bam files, and use a program such as
featureCountsto generate raw counts.
Agree on this, because after normalization who knows how the original values were modified.