Question: Processing ATAC-seq data after peak calling
gravatar for anais1396
2.4 years ago by
anais139630 wrote:

Hello everyone !!

I've started to work on ATAC-seq data and I would like to know how to process data after peak calling ?

I've two group to study, patients and healthy control, and after performing peak calling to see regions of open chromatin (that correspond to genes potentially expressed), I would like to look for the notably differences between the 2 groups in order to see what's wrong with patients. For instance, I would like to see where some genes are expressed in patients and not in control and vice versa.

Is there a simple way to do those analyses ? Or maybe mutiple ways ? maybe is there a similar analyse from an other NGS technique like ChIP-seq, MNase-seq, DNase-seq, etc... ? What are the tools or pipeline usually used for that ?

Thank you in advance !!


sequencing peak calling • 2.3k views
ADD COMMENTlink modified 2.3 years ago by phosphodiester_bond40 • written 2.4 years ago by anais139630
gravatar for James Ashmore
2.4 years ago by
James Ashmore3.0k
UK/Edinburgh/MRC Centre for Regenerative Medicine
James Ashmore3.0k wrote:

You could look for differentially accessible regions between your treatment and control groups. The DiffBind or csaw packages can help you with this analysis. Once you have the differentially accessible regions you can start looking at genes they are close to or overlap. The GenomicFeatures and GenomicRanges and packages can be used to get a list of gene locations and look for overlaps. You can then try and profile this list of genes using something like a Gene Ontology or Gene Set Enrichment analysis. The goseq package is a good option to perform this analysis.

ADD COMMENTlink written 2.4 years ago by James Ashmore3.0k
gravatar for gildas.lepennetier
2.4 years ago by
gildas.lepennetier10 wrote:

Hi anais1396, What I am doing right now (and I am also quite new to the subject) is the following:

  1. from the reads1 and reads2, I align to a reference genome using bowtie2
  2. conversion from sam to bam
  3. using MACS2 -f BAMPE (for pair-ends ) --broad
  4. annotation using HOMER and

This should give you already a list of genes that are concerned by the open chromatin, according to the closest TSS.

Just leaving that here, because I am quite sure some may want to add more steps to that.

ADD COMMENTlink written 2.4 years ago by gildas.lepennetier10
gravatar for phosphodiester_bond
2.3 years ago by
phosphodiester_bond40 wrote:

Hi Anaïs,

If you'd like to analyze differences in transcription factor activity between the two groups, we developed a tool to do this using your called peaks from ATAC-seq:

More info here:

Good luck!

ADD COMMENTlink written 2.3 years ago by phosphodiester_bond40

Hi phosphodiester_bond,

sounds promising. Good to see that ATAC-seq tools are continuously being published. Can you comment on how your approach is different to the existing chromVAR approach from the Greenleaf lab?

ADD REPLYlink written 2.3 years ago by ATpoint41k


I would need to look at the documentation carefully, but at first pass it seems these are two different tools and chromVAR is more focused on comparing the ATAC-seq signal itself between experiments, rather than the estimated levels of TF activity between the two datasets. It looks like it provides functions to figure out what motifs are overlapped by a particular peak (fixed to Jaspar, while DAStk can be used with scanned motif sites from any sources), but not the comparison experimet-wise of what are the most significant changes in TF activity. Thanks for pointing this out, though, because I haven't heard of it and may be handy in other scenarios!

(sorry about the late response, I need to setup notifications)

ADD REPLYlink written 2.2 years ago by phosphodiester_bond40
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1380 users visited in the last hour