Question: Enrichment Test between Genomic and Epigenomic Regions
gravatar for German.M.Demidov
3.2 years ago by
German.M.Demidov1.5k wrote:

Hi all,

I have detected some interesting epigenetic regions on the genome (for example, differentially methylated regions) and I have some features on the genome (for example, I classify all the genome to 3'UTR/5'UTR/gene body/promoter/enhancer/junk region).

Different features have different lengths (i.e. junk regions occupy the majority of the genome).

How to say if my set of intervals - diff methylated regions - is enriched in one of the features? One interval can have intersection with several features. I know how to apply a proportion test, I do not know how to normalise number of intersections between features and intervals for the length of features.

enrichment • 844 views
ADD COMMENTlink modified 3.0 years ago by Biostar ♦♦ 20 • written 3.2 years ago by German.M.Demidov1.5k

Careful with the idea of 'junk' regions. It's thought that methylation is actually majorly involved in the repression of transposable elements, that make up about 50% of the human genome and the vast majority of other genomes as well. It's entirely possible that this junk region is in fact mostly composed of insulator/TEs.

You could try a hyper geometric test to test for enrichment.

ADD REPLYlink modified 3.2 years ago • written 3.2 years ago by Sinji3.0k

Agree with everything, called it "junk" just in order not to specify all 30 options. It is absolutely clear how to use hypergeometric test (i.e. ) while having a control dataset, but I do not understand how to work in terms of features/intervals only, what is "Population", what is "Sample" and what is a success in this case.

ADD REPLYlink written 3.2 years ago by German.M.Demidov1.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1763 users visited in the last hour