I have some problems regarding choosing background sequences for motif analysis let say I have 3 different conditions (DNase I data), I did clustering and I want to see the differences in motif enrichment among the clusters. Each cluster has different number of intervals but for motif analysis I made them all with the same length (200 bp) (summits). my questions are: (1) shall I use the total DNase I peaks (merged from all 3 conditions) as background or genomic regions? which one is better or that makes any difference? (2) in order to do motif search for each cluster, shall I use the same set of background sequences or I should justify it based on CG% and number of regions for each cluster separately?