Question: Significance Analysis Of Chip Seq Overlaps
7.2 years ago by
revival78650 wrote:

If I have two types of Transciption factor binding regions say A n B

suppose A binding regions =5000 B binding regions=10000 overlaps observed=500 How to calculate the significance [No peak data is available ]

statistics • 3.2k views
written 7.2 years ago by revival78650
7.2 years ago by
Ido Tamir5.0k
Ido Tamir5.0k wrote:

This is a duplicate of How Do You Calculate If Two Sets Of Genomic Regions Overlap Significantly? where some answers are given. For completeness:

Here is a review on some resampling methods (the bioconductor packages is still not released)

The encode GSC (genome structure correction) from the ENCODE trial project is still not published independently in a paper but some python source code is available here (the original was in matlab, but could be made to run in octave):

You can not answer this without peak data being available, because the size of the peaks is inverse proportional to the significance of the overlaps. The larger the peaks the more likely that the two overlap by chance. Mappability/Detectability is also crucial.

written 7.2 years ago by Ido Tamir5.0k

Ok using any permutation test using R [like coccur package] while going without peak data....from where to strt as m beginner at R

written 7.1 years ago by revival78650
