Question: Differential Open Chromatin Locations From Dnase-Seq Data
gravatar for Shumon Shumon
7.9 years ago by
Shumon Shumon10 wrote:

I have DNase-seq data (two SAM files or two BED files) in two different cell lines (no control data in any of the cell line) from ENCODE. I would like to have enriched/significantly differentially open chromatin location. In short comparison between accessible regions between: cell-A/cell-B.

Is there any tool that would do that for me? Suggested/recommended command-line option settings for that tool would also help.


next-gen • 4.0k views
ADD COMMENTlink modified 7.5 years ago by spacemorrissey240 • written 7.9 years ago by Shumon Shumon10
gravatar for Fidel
7.9 years ago by
Fidel1.9k wrote:

What you can do is to use a peak caller between the two files. You will do this twice, once to find enriched regions in one file that are not in the other and viceversa. MACS is one of the tools that will do the job. You just need to convert your SAM files to BAM.

Normally, peak callers try to find the enriched genomic regions of a sample, but since the distrubution of reads is not homogeneous througout the genome they use a so called input file to distinguish spurious enrichments from real enrichments. In other words, peak callers assess enrichments given a background signal. However, you can replace this background signal for comparison with another comparable sample (based in the same genome assembly) that is not an input. In that case the peak caller will identify enrichments that are statistically significant in one of the samples with respect to the other sample.

ADD COMMENTlink written 7.9 years ago by Fidel1.9k
gravatar for spacemorrissey
7.5 years ago by
United States
spacemorrissey240 wrote:

MACS is not really an appropriate tool for calling DNase peaks. I use F-seq. If this is ENCODE data, there should be peaks or hotspots determined by the lab that generated the data available on the browser. You can just download the peak sets and do intersections on Galaxy.

ADD COMMENTlink written 7.5 years ago by spacemorrissey240

This is a very delayed followup, but I just found this: "MACS and F-SEQ are considered among the best reported peak callers for the DNaseI-Seq" at

Posting it now in case anyone else is researching DNase-Seq peak callers.

ADD REPLYlink written 4.1 years ago by igor11k

I second this - I have just completed MACS2 peak calling on ATAC-seq data (similar to DNase-seq) with good results

ADD REPLYlink written 3.5 years ago by YaGalbi1.5k
gravatar for Ying W
7.9 years ago by
Ying W4.0k
South San Francisco, CA
Ying W4.0k wrote:

I would recommend loading the regions of interest (open regions from peak caller in BED format) into R as GRanges objects then doing and overlap and seeing which reads are unique for each condition. This is of course assuming you know R. I do not know of any tools that will give you this data without some programming.

ADD COMMENTlink modified 7.9 years ago • written 7.9 years ago by Ying W4.0k
gravatar for dnaseiseq
7.9 years ago by
United Kingdom
dnaseiseq210 wrote:

maybe this article can be interesting for you:

Madrigal P and Krajewski P (2012) Current bioinformatic approaches to identify DNase I hypersensitive sites and genomic footprints from DNase-seq data. Front. Gene. 3:230. doi: 10.3389/fgene.2012.00230

ADD COMMENTlink written 7.9 years ago by dnaseiseq210
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2186 users visited in the last hour