Question

Test independence categorical variables in genomic data (TF binding and regulated loci)

0

Entering edit mode

9.6 years ago

annerionta93 ▴ 10

I have a question regarding testing the independence of two categorical variables in biology. I'll first explain it in biological terms and then more generally.

I have a list of down regulated loci and a list of binding events of a transcription factor. How can I test whether my TF affects the regulation of the loci?

In more general words there are windows in the genome that are either classified as yes or no for two categories. I want to test whether one category affects the other. However, in the contingency table most of the windows will have both a no no.

At first I thought I would treat this as a chi squared test but I always get that we can reject the null hypothesis of independence of the categories. Since I think there is always a really large no- no category. Here's a sample of what I see:

         TF_binding_event
   loci     no     yes
   no  2510067    1070
   yes    2736       4

Any suggestions on how to deal with this?

Thanks

RNA-Seq ChIP-Seq • 1.6k views

ADD COMMENT • link updated 2.9 years ago by Ram 45k • written 9.6 years ago by annerionta93 ▴ 10

score 0 · Answer 1 · 2015-12-08

0

Entering edit mode

9.6 years ago

gavinmdouglas ▴ 10

I think a chi-squared or fisher's exact test would be fine for this. You might want to increase the window sizes to increase the # of values in "yes yes" though.

ADD COMMENT • link 9.6 years ago by gavinmdouglas ▴ 10