How Bimodal Is A Methylation Distribution?
4
5
Entering edit mode
13.0 years ago
Pablo ★ 1.9k

Analysing methylation from a sequencing experiment, we seem to be having some bimodal distributed values.

The question I was posed (which I'm still not so sure whether makes sense or not) is "Are you sure this is bimodal?". Needless to say, the plot doesn't show a clear "yes" or "no" answer. I.e. it's neither two well defined peaks nor one single peak, but something in between. So the questions are:

1- Is there a way to assign a p-value (or any metric) on how bimodal a distribution is?

2- Does it even make sense to ask this question? (Why or why not?).

Apologies in advanced if the question is too off topic.

statistics methylation • 5.3k views
ADD COMMENT
6
Entering edit mode
13.0 years ago

1) The advice given in this CrossValidated thread seems solid. Really, if you've got a hairy stats question, those are the guys to ask.

2) Bimodal distribution of methylation scores is is common, from what I understand. Here's a figure showing a distribution of methylation scores in CpG islands, taken from the supplement of this paper:

Harris, et al. Comparison of sequencing-based methods to profile DNA methylation and identification of monoallelic epigenetic modifications

Nature Link: http://www.nature.com/nbt/journal/v28/n10/full/nbt.1682.html

PMC Link: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2955169/?tool=pubmed

from supp

ADD COMMENT
1
Entering edit mode
13.0 years ago

A google search on testing for modality picks up this bit of advice

From work I am involved in on CpG islands I guess that your trends should be bimodal, it can be seen clearly in CpG island methylation in human and in full transcript methylation in Ciona.

There will be a Deaton et al paper coming out in Genome Research later in the year that will also show that the 'CpG shores' hypothesis is less likely.

ADD COMMENT
0
Entering edit mode

The reference you gave is about someone who comments on a method based on a package that he never used or even looked at.

Sorry, but it does seem a little bit too vague (plus it doesn't answer my question).

Also, minus one for referencing Google.

ADD REPLY
0
Entering edit mode
13.0 years ago
Genotepes ▴ 950

Very preliminary comment before heading home,

looks like a mixture distributions problem. If you data looks normal, then an EM algorithm and likelihood ratio test could solve this.

I am sure this is not quite so easy.

Christian

ADD COMMENT
0
Entering edit mode
12.6 years ago

Actually, I get similar distribution from HumanMethylation450 data.

ADD COMMENT

Login before adding your answer.

Traffic: 1941 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6