Extract RPKM values from Diffbind
2
1
Entering edit mode
3.8 years ago
francesca3 ▴ 140

Hi everyone, does anyone know if there is a way to extrapolate rpkm values from Diffbind?? Thanks Francesca

ChIP-Seq Diffbind RPKM • 1.4k views
ADD COMMENT
2
Entering edit mode
3.8 years ago
Rory Stark ★ 2.0k

You can set the read score to DBA_SCORE_RPKM in dba.count(). You can do this when you originally count, or anytime after by calling

DBA <-  dba.count(DBA,peaks=NULL,score=DBA_SCORE_RPKM)

Then you can retrievethe RPKM values in a GRanges object (or data.frame):

rpkm <- dba.peakset(DBA, bRetrieve=TRUE)
ADD COMMENT
1
Entering edit mode

Thank you Rory. I have another question. When I retrieve these RPKM values I obtain very low numbers (ranging from around 0.5 to 2). Which values should I expect for trusted regions? If I retrieve TMM values (DBA_SCORE_TMM_MINUS_FULL) these values range from around 10 to 200. Usually I was counting using DBA_SCORE_TMM_MINUS_FULL. Thank you.

ADD REPLY
2
Entering edit mode
3.8 years ago
Rory Stark ★ 2.0k

RPKM values are much lower in general than read counts. The first step is to divide by how many million sequence reads you have in your library. So if you sequenced to a depth of 20M reads, all the counts would be divided by at least 20.

Here's how it's implemented in DiffBind:

rpkm    <- (counts/(width(intervals)/1000))/(libsize/1e+06)
ADD COMMENT
0
Entering edit mode

are counts the same as pileup in macs2?

I guess, more broadly, my question would probably be: what field in an excel report from a peakcaller like macs2 is the count number?

Secondly, for the calculation of rpkm of each sample intervals are taken as-is from the peakcaller or it uses the consensus peakset, calculated by DiffBind?

ADD REPLY

Login before adding your answer.

Traffic: 2437 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6