Hello! I have some, pearhaps naive, questions about consensus peaks called using Tao Liu's script on GitHub called merge_then_call_consensus.sh.
Say, for example, I have 4 replicate samples and I manually find the overlapping peaks between them and keep the peaks with overlap in at least 3 of the 4 samples (using something like bedtools intersect). What will be the difference between these and consensus peaks called using the aforementioned script?
Also, say I have found consensus peaks across replicates for a TF ChIP in control versus stimulation conditions. I now want to get differential peaks between control and stimulation with the MAnorm tool. The tool requires that I provide the peak files as well as the read files. Can I use the consensus peak file of each condition with the read file produced by merging all the initial replicate bam files?
Or does it make more sense to merge the bam files for each condition, call peaks and then get differential peaks between control and stimulation?
Thanks in advance!
Really interesting answer. I had missed that paper and will digest it asap. I usually use DiffBind, but recently have been looking into potential alternatives.
That having said, linked paper and its authors developed the csaw package over at Bioconductor that uses a window approach to avoid peak calling altogether, and use the windows for DE analysis.
Thank you for your response! Interesting paper.