Question: Differential Open Chromatin Locations From Dnase-Seq Data
1
gravatar for Shumon Shumon
6.4 years ago by
Shumon Shumon10 wrote:

I have DNase-seq data (two SAM files or two BED files) in two different cell lines (no control data in any of the cell line) from ENCODE. I would like to have enriched/significantly differentially open chromatin location. In short comparison between accessible regions between: cell-A/cell-B.

Is there any tool that would do that for me? Suggested/recommended command-line option settings for that tool would also help.

thanks

next-gen • 3.6k views
ADD COMMENTlink modified 6.0 years ago by spacemorrissey200 • written 6.4 years ago by Shumon Shumon10
2
gravatar for Fidel
6.4 years ago by
Fidel1.9k
Germany
Fidel1.9k wrote:

What you can do is to use a peak caller between the two files. You will do this twice, once to find enriched regions in one file that are not in the other and viceversa. MACS is one of the tools that will do the job. You just need to convert your SAM files to BAM.

Normally, peak callers try to find the enriched genomic regions of a sample, but since the distrubution of reads is not homogeneous througout the genome they use a so called input file to distinguish spurious enrichments from real enrichments. In other words, peak callers assess enrichments given a background signal. However, you can replace this background signal for comparison with another comparable sample (based in the same genome assembly) that is not an input. In that case the peak caller will identify enrichments that are statistically significant in one of the samples with respect to the other sample.

ADD COMMENTlink written 6.4 years ago by Fidel1.9k
2
gravatar for spacemorrissey
6.0 years ago by
United States
spacemorrissey200 wrote:

MACS is not really an appropriate tool for calling DNase peaks. I use F-seq. If this is ENCODE data, there should be peaks or hotspots determined by the lab that generated the data available on the browser. You can just download the peak sets and do intersections on Galaxy.

ADD COMMENTlink written 6.0 years ago by spacemorrissey200
1

This is a very delayed followup, but I just found this: "MACS and F-SEQ are considered among the best reported peak callers for the DNaseI-Seq" at http://chipster.csc.fi/material/chipseq/tutorials/hashem/DNase-seqIntroduction.pdf

Posting it now in case anyone else is researching DNase-Seq peak callers.

ADD REPLYlink written 2.5 years ago by igor7.4k

I second this - I have just completed MACS2 peak calling on ATAC-seq data (similar to DNase-seq) with good results

ADD REPLYlink written 2.0 years ago by YaGalbi1.4k
0
gravatar for Ying W
6.4 years ago by
Ying W3.9k
South San Francisco, CA
Ying W3.9k wrote:

I would recommend loading the regions of interest (open regions from peak caller in BED format) into R as GRanges objects then doing and overlap and seeing which reads are unique for each condition. This is of course assuming you know R. I do not know of any tools that will give you this data without some programming.

ADD COMMENTlink modified 6.4 years ago • written 6.4 years ago by Ying W3.9k
0
gravatar for dnaseiseq
6.4 years ago by
dnaseiseq200
United Kingdom
dnaseiseq200 wrote:

maybe this article can be interesting for you: http://www.ncbi.nlm.nih.gov/pubmed/23118738

Madrigal P and Krajewski P (2012) Current bioinformatic approaches to identify DNase I hypersensitive sites and genomic footprints from DNase-seq data. Front. Gene. 3:230. doi: 10.3389/fgene.2012.00230

ADD COMMENTlink written 6.4 years ago by dnaseiseq200
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2264 users visited in the last hour