Question: Number of shared biological GO terms
gravatar for qwzhang0601
5.3 years ago by
United States
qwzhang060170 wrote:

I have a group of genes and want to test whether the gene pairs from my list tend to involve simultaneously in more biological process than that of gene pairs from all protein coding genes. So I want to count the number of BP GO terms shared by each gene pairs. However, I am afraid that there will be bias to better studied genes, since the hierarchical structure of GO terms (if two genes share a child GO terms, they will share all higher-level GO terms). Is there a way to reduce such bias? Thanks


go terms • 1.4k views
ADD COMMENTlink modified 5.3 years ago by Biostar ♦♦ 20 • written 5.3 years ago by qwzhang060170

I am not sure if this is possible but maybe you can try to perform permutation using genes that have equal levels of BP terms.

E.g. Gene A and Gene B is a pair and they each have 10 and 30 BP GO terms respectively. Then you can randomly select genes that have 10 and 30 BP GO terms and see if the number of GO terms shared among them are higher or lower than that of the pair you are studying. This might give you an empirical p-value as an reference. However, this will only work if there are a lot of other genes that have equal (or more) BP GO terms as the one that you are studying or else you will not be able to generate a good enough null distribution for the data....

ADD REPLYlink written 5.3 years ago by Sam3.0k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1126 users visited in the last hour