Question

Getting differential ChIP-seq peaks between conditions after calling consensus peaks

0

Entering edit mode

5 months ago

Alexandra • 0

Hello! I have some, pearhaps naive, questions about consensus peaks called using Tao Liu's script on GitHub called merge_then_call_consensus.sh.

Say, for example, I have 4 replicate samples and I manually find the overlapping peaks between them and keep the peaks with overlap in at least 3 of the 4 samples (using something like bedtools intersect). What will be the difference between these and consensus peaks called using the aforementioned script?

Also, say I have found consensus peaks across replicates for a TF ChIP in control versus stimulation conditions. I now want to get differential peaks between control and stimulation with the MAnorm tool. The tool requires that I provide the peak files as well as the read files. Can I use the consensus peak file of each condition with the read file produced by merging all the initial replicate bam files?

Or does it make more sense to merge the bam files for each condition, call peaks and then get differential peaks between control and stimulation?

Thanks in advance!

MAnorm ChIP-seq consensus peaks • 710 views

ADD COMMENT • link 5 months ago by Alexandra • 0

score 3 · Answer 1 · 2024-05-15

3

Entering edit mode

5 months ago

ATpoint 85k

This has formally been investiated by a reputable group here: https://academic.oup.com/nar/article/42/11/e95/1442937

IIRC they think that merging bams and calling peaks on that is "best", given the choices of calling individually + merge, or joint calling.

Tbh, in practice it depends. If I have multiple groups and I need per-group peaks for downstream analysis I often just merge the individual peaksets. Or I call peaks per sample and keep peaks for the consensus that are called in at least x% of samples. It really depends. Maybe that's statistically not pretty, but it always depends on the project.

In the end, I make a count matrix based on the consensus peaks with featureCounts and analyze with DESeq2/edgeR/limma. I don't know MAnorm, but I doubt it's better than these three given how robust and established they are.

ADD COMMENT • link 5 months ago by ATpoint 85k

0

Entering edit mode

Really interesting answer. I had missed that paper and will digest it asap. I usually use DiffBind, but recently have been looking into potential alternatives.

ADD REPLY • link 5 months ago by Ian 6.1k

0

Entering edit mode

That having said, linked paper and its authors developed the csaw package over at Bioconductor that uses a window approach to avoid peak calling altogether, and use the windows for DE analysis.