Hypergeometric Test on Two Mutational Spectra
1
0
Entering edit mode
9.7 years ago

I have two mutational spectra that I need to compare to see if they are significantly different from one another. Each column of the data below illustrates how many mutations were found at a given location in each sample. I am aware that Scipy has hypergeometric functions that will suit my needs but with my limited stats knowledge I am having a hard time distilling this raw information into the function to acquire the p-value.

Ultimately I am trying to write a script using Biopython/Scipy/Numpy to do what is outlined in the journal article Statistical Test for the Comparison of Samples from Mutational Spectra by W. Thomas Adams and Thomas R. Skopek in 1986.

Here is a sample data-set to work with that resembles the data I am looking at.

Pos.        Sample 1         Sample 2
2           0                3
3           0                0
6           0                1
8           0                0
12          2                5
15          1                1
26          1                0
34          2                0
47          0                2
77          4                4

Let me know if more details need to be provided!

mutation statistics hypergeometric spectra python • 1.9k views
ADD COMMENT
0
Entering edit mode

To give an update on my progress. I am developing a script to handle the Adams & Skopek algorithm to answer my own question. I will post my answer once it's complete and working.

ADD REPLY
0
Entering edit mode
9.7 years ago

Joint allele-frequency spectrum are cool. Are you aware of this paper? The program associated with the paper (DADI) might have the test you are looking for. If not the paper is still a good read.

ADD COMMENT
0
Entering edit mode

I'll give it a read and see if I can get anywhere. Thanks for the tip.

ADD REPLY

Login before adding your answer.

Traffic: 1773 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6