Question

Chip-Seq Ratio Intensity And Normalization

1

Entering edit mode

12.5 years ago

Snape_Ar ▴ 10

I am interested in the data analysis of ChIP-Seq data. I am interested to know how to calculate the following:

Log2 Intensity Ratios and what does it mean ? Also, the ratio is between the replicates or between a background and a replicate and how do you calculate the intensity value for each sample?
I know how the local regression (lowess) works but why is smoothing needed for ChIP-Seq data. Is it needed to remove the non-specific binding peaks ?
What does a genome-wide mean in the ChIP-Seq sample means ?

The questions are based on the Paper that was published in Nature and link of the paper is as follows: http://www.nature.com/nature/journal/v471/n7339/full/nature09725.html.

Any hints or advice will be highly appreciated.

Thanks.

chip-seq statistics data • 4.2k views

ADD COMMENT • link updated 12.1 years ago by brentp 24k • written 12.5 years ago by Snape_Ar ▴ 10

1

Entering edit mode

FYI, the in paper you refer to, they used ChIP-chip (tiling microarrays) -- but in your question you ask about ChIP-seq

ADD REPLY • link 12.5 years ago by Ryan Dale 5.0k

Ram · Answer 1 · 2011-10-26

1

Entering edit mode

12.5 years ago

brentp 24k

The log-ratio is the ratio of the 2 channels in the microarray. See, e.g.: http://en.wikipedia.org/wiki/DNA_microarray#Two-channel_vs._one-channel_detection. Nearly all software packages will do this and other normalization steps for you.
Since the probes are often overlapping (http://en.wikipedia.org/wiki/Tiling_array), lowess is used to smooth adjacent / overlapping peaks.
??

ADD COMMENT • link updated 4.6 years ago by Ram 43k • written 12.5 years ago by brentp 24k

0

Entering edit mode

I understand the log-ratio of 2 channels in microarray but how do you calculate it with the ChIP-Seq data.
Thanks.
I think even though the total intensities are not very well normalized. So after calculating the intensity values,and shift them so that the mean intensity value across the genome is 0. In other words, we adjust the intensity values by subtracting the mean of intensity values across all tiled regions of the genome.

But my question still remains how to calculate the intensity values ?

Anyways thanks. I appreciate it.

ADD REPLY • link 12.5 years ago by Snape_Ar ▴ 10