Question: Questions related to ChromHMM+using MACS2 peak calling output bed file as input for ChromHMM?
0
gravatar for sikhtechai
6 days ago by
sikhtechai30
sikhtechai30 wrote:

Hi all!

I have 3 questions about running the ChromHMM tool to find the combinatorial states of multiple ChIPseq data.

  1. I have replicate ChIPseq data(4 replicates each) for H3K4me3 and H3K27me3 in control vs treatment. For ChromHMM, do I need to merge all the replicates and then run it, or I can run the replicates individually?

  2. As the tutorial suggests, one should start with the bed file coming from the original bam alignment file. But has anyone tried it by using Macs2 peak files? I mean, using MACS2 first to call reliable peaks with a cutoff, and then using ChromHMM to call the combinatorial chromatin states?

  3. Also, as I have two groups, control vs treatment. In each group, I have both H3K4me3 and H3K27me3 ChIPseq data(with 4 replicates). Now, to define bivalent chromatin states, do I give all the control and treatment H3K4me3 and H3K27me3 data in ChromHMM to learn the model or only the control data?

Thank you in advance for your help!

chromhmm macs2 chip-seq • 71 views
ADD COMMENTlink modified 6 days ago by Ar860 • written 6 days ago by sikhtechai30
2
gravatar for Ar
6 days ago by
Ar860
United States
Ar860 wrote:

Here are the answers to your questions:

  1. Don't merge and run all of them together.

  2. One of the first steps of ChromHMM is peak calling. Therefore, you need to use the bed files from bam files and not peak files. Although the default settings (i.e. ) would give you less stringent calls (argument: -p or poissonthreshold in BinarizeBed function). You may change it 1e-5 or less. If you are interested in using MACS peak calls then you may use the argument -peaks within the BinarizeBed function. However, the latter is not recommended for broad peaks marks such as H3K27me3 and H3K9me3.

  3. Yes, you have to give control files for all the treatment file. It is again required for calling the peaks.

Good luck!

ADD COMMENTlink written 6 days ago by Ar860

Thank you very much for your quick answer! It makes sense. However, regarding answer 3, the control file here means, the sample which did not receive the drugs. It is not like, the input control or IgG control for ChIP. Then do you think I should still include them for learning the model? Or, learn the model with only the control group > get the emission states containing both H3K4me3 and H3K27me3, then use those regions(supposedly bivalent regions) to do differential analysis in treatment vs control?

Or do you have any other idea of getting bivalent promoter regions from H3K4me3 and H3K27me3 data? I was thinking at first to just use "bedtools" to intersect the MACS2 peak calling files to get overlapping regions containing H3K4me3 and H3K27me3 and use those genomic regions for differential analysis in my drug treatment vs control samples. Does it make sense?

Thanks a lot for taking the time to help me! Really appreciate it!

ADD REPLYlink written 6 days ago by sikhtechai30
1

It is not like, the input control or IgG control for ChIP. Then do you think I should still include them for learning the model? You can include them but a lot of your called by ChromHMM or MACS would have false positives. Ideally, it should be a whole genome input control or IgG-treated file.

Or, learn the model with only the control group > get the emission states containing both H3K4me3 and H3K27me3, then use those regions(supposedly bivalent regions) to do differential analysis in treatment vs control? No need to have 2 different models. Use only one model having all the files. Otherwise, it would be tough to infer anything using both the models.

If you have only two marks and are interested in bivalent regions then I would not recommend you to run ChromHMM. It is useful when you have more than 3 marks. For bivalent regions, call MACS using each mark and then do the intersection of the peaks using bedtools or bedops. If you want to do differential analysis, then use DIffBind.

ADD REPLYlink written 6 days ago by Ar860

That's a very good advice! Thank you very much for your suggestion!

ADD REPLYlink written 6 days ago by sikhtechai30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1787 users visited in the last hour