Question: Cross-Correlation On Chip-Seq Like Data?
gravatar for Alastair Kerr
9.9 years ago by
Alastair Kerr5.3k
Manchester/UK/Cancer Biomarker Centre at CRUK-MI
Alastair Kerr5.3k wrote:

Has anyone looked into the similarity of two samples of chip-seq like data using cross-correlation or auto-correlation? Basically I wish to get some sort of metric that describes how similar two wiggle files are on a genome-wide scale. Or inversely how much variance in one sample can be described by looking at the pattern of distribution in the other. If you have, what software or library did you use or alternatively what different approach have you used?

wiggle chip-seq statistics • 4.1k views
ADD COMMENTlink modified 9.9 years ago by Jeremy Leipzig19k • written 9.9 years ago by Alastair Kerr5.3k

The way I understand it is that cross-correlation gets you the lag between signals; autocorrelation gets the periodicity within a single signal. Do you mean plain 'ol correlation, like on a bp-by-bp or window-by-window basis?

ADD REPLYlink written 9.9 years ago by Ryan Dale4.9k

Good points. I was hoping that cross-correlation would not require there to be a periodicity and the signal itself would be enough for a measurement of similarity (I am still not clear on this). Regular correlation/regression might work I guess but I am concerned about the quantity of data for this approach. Perhaps a 100bp window per chromosome is the only way and avoid comparing two null or two background signals as per Jeremy's point below.

ADD REPLYlink written 9.9 years ago by Alastair Kerr5.3k
gravatar for Dawe
9.9 years ago by
Dawe270 wrote:

I'm kind of interested in this topic. I'm developer of a package for digital signal processing of quantitative genomic data (such as ChIP-seq). Cross-correlation is something I was implementing too, although I had some issues with the border effect, and also I was not sure xcorr is a proper measure of similarity for two ChIPs. In addition, subtracting one chip from the other (after energy normalization) may have the same effect. d

ADD COMMENTlink written 9.9 years ago by Dawe270
gravatar for Jeremy Leipzig
9.9 years ago by
Philadelphia, PA
Jeremy Leipzig19k wrote:

Seems like every time someone tries to use correlation to measure something like this the 0-0 effect throws things off. I prefer just calling the peaks and counting overlaps (by findOverlaps in R IRanges or using BEDTools).

ADD COMMENTlink written 9.9 years ago by Jeremy Leipzig19k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1682 users visited in the last hour