Is calcuNormFactors necessary in the calculation of RPKM?
1
1
Entering edit mode
4.8 years ago
wangdp123 ▴ 340

Hi there,

It seems that there are two ways of calculating RPKM values based on rpkm function of edgeR from the output of featurecounts. One uses calcuNormFactors and the other doesn't.

I wonder which one is more appropriate for generating RPKM values from RNA-Seq analysis? Why?

1)

x <- DGEList(counts=fc$counts, genes=fc$annotation[,c("GeneID","Length")])
x_rpkm <- rpkm(x,x$genes$Length)

2)

y <- DGEList(counts=fc$counts, genes=fc$annotation[,c("GeneID","Length")])
y <- calcNormFactors(y)
y_rpkm <- rpkm(y,y$genes$Length)

Many thanks,

Tom

RNA-Seq edgeR RPKM • 1.4k views
ADD COMMENT
1
Entering edit mode
4.8 years ago
ATpoint 82k

It is the second one as this takes into account the TMM-derived scaling factors which, beyond sequencing depth also corrects for library composition changes. There is a video on YT on TMM normalization (StatQuest series) that explains this quite well.

ADD COMMENT

Login before adding your answer.

Traffic: 2575 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6