Control gene size for gene enrichment study

1

Entering edit mode

8.0 years ago

michealsmith ▴ 790

I would like to test if genes containing at least one transcription factor (say MEF2A) binding sites are enriched for certain category.

I could easily come up with a TF-containing-gene list by intersecting TF binding sites bed files with gene annotation bed files, and send for enrichment study.

But question is: if one gene is big, naturally it tends to be more likely to contain TF binding sites. So should I first control gene size?

So I should normalize by assigning one parameter to each gene as: (overlap size)/(gene size) ? And then sort and select say the top 200 or 500?

gene enrichment GO • 1.5k views

ADD COMMENT • link 8.0 years ago by michealsmith ▴ 790

1

Entering edit mode

Is the transcription factor more likely to be biologically relevant when bound to promoters? If that's true, you could just restrict the overlap to TSS+/- 1kb which would generate fragments of equal length.

ADD REPLY • link 8.0 years ago by jotan ★ 1.3k

0

Entering edit mode

No, the TF bind to everywhere, which are all biologically relevant.

ADD REPLY • link 8.0 years ago by michealsmith ▴ 790

Login before adding your answer.