Question: Motif analysis in repeat-rich ChIP-seq data?
0
gravatar for robbuurstede
12 months ago by
robbuurstede0 wrote:

Hi all,

I’m analyzing ChIP-seq data and currently performing a de novo motif analysis using MEME. The output shows a high number of repeat motifs (e.g. GAGAGAGAGAGA) and only as the 10th motif I find the motif of the TF I chipped. The data is very repeat rich, but repeat masking also results in loss of my TF motif as it is within these repeats. I’m afraid that this will not enable me to identify any other co-occurring motifs of interest, so I was wondering if there is a better approach.

Is there a way to tell MEME not to recognize these simple repeats as motifs? I’ve not found out how to do this just yet.

Do you recommend another tool which is more suitable for repeat-rich sequences?

Thank you very much!

Rob

motif meme chip-seq • 348 views
ADD COMMENTlink modified 12 months ago by simon.vanheeringen190 • written 12 months ago by robbuurstede0

Can't this be biologically meaningful? Which TF is this?

ADD REPLYlink written 12 months ago by ATpoint34k

It sure could be biologically relevant, but I expected the number one motif to be that of the Glucocorticoid Receptor (the chipped TF).

ADD REPLYlink modified 12 months ago • written 12 months ago by robbuurstede0
0
gravatar for ATpoint
12 months ago by
ATpoint34k
Germany
ATpoint34k wrote:

You probably have the peak coordinates and then used something like bedtools getfasta. What you can do is to first identify the genomic coordinates of these repeats, e.g. using any of the solutions from A: Code golf: detecting homopolymers of length N in the (human) genome (modified to match these tandem patterns you encountered) and then use these coordinates to filter out any peaks that intersect with these blacklisted coordinates e.g. using bedtools intersect. Then get sequences from the remaining peaks and re-run the motif search.

ADD COMMENTlink modified 12 months ago • written 12 months ago by ATpoint34k
0
gravatar for Friederike
12 months ago by
Friederike5.6k
United States
Friederike5.6k wrote:

Seems like dust might be a helpful tool for this. You could also browse the excellent MEME suite Q&A page or post your own question there.

ADD COMMENTlink written 12 months ago by Friederike5.6k
0
gravatar for simon.vanheeringen
12 months ago by
simon.vanheeringen190 wrote:

Try GimmeMotifs. It combines different motif prediction tools (including MEME) and compares the identified motifs to a background set of sequences. It usually work very well for ChIP-seq data (disclaimer: I wrote the software).

ADD COMMENTlink written 12 months ago by simon.vanheeringen190
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1170 users visited in the last hour