Hi!
I was wondering if their is a way to normalise the number of reads in a region and the RPKM of the nearest gene to that region, so that a correlation could be computed.
Like the following data shows number of tags in first column and RPKM in second column
Tags     RPKM
15        0.14619
11        0
203        0.2259
129        10.701
300        7.0772
122        2.3234
346        10.666
77        3.117
201        16.749
What is the most potent way to normalise/scale the data in two columns so that a correlation can be computed?
Thank you
Wouldn't that just be computing the RPKM for your region?
The aim is to correlate occupancy with RPKM