Question: Where To Find Encode Dnase I Footprint Data
gravatar for joshualevipayne
5.6 years ago by
joshualevipayne70 wrote:


I would like to download the ENCODE DNase I footprint data for the 41 human cell and tissue types described by Neph et al. (2012) Nature, 489:83-90.

Here is an example to illustrate my question. Consider only a single human cell type, K562. The following DNase I data files can be found via NCBI's GEO repository, using accession number GSE26328 (see







Upon downloading these files, it becomes apparent that none of them contain the actual footprints. Perhaps the most relevant file is GSM646567_hg19_wgEncodeUwDgfK562Hotspots.broadPeak.txt.gz, which contains 256,735 lines, each pertaining to a DNase I Hypersensitive Zone (see Indeed, this number matches up exactly with the number of "Hotspot Regions" reported in Table S1 of the Neph et al. (2012) paper.

What I am looking for, however, is a .bed file that lists the coordinates of the footprints that these Hotspot Regions contain. Based on Table S1, this file should have 498,683 lines (see "Number of Footprints" column).

Does anyone know where to find these files? Moreover, does anyone know where to find the Footprint Occupancy Scores for these 41 cell and tissue types?

encode • 2.9k views
ADD COMMENTlink modified 5.6 years ago by sjneph600 • written 5.6 years ago by joshualevipayne70
gravatar for sjneph
5.6 years ago by
sjneph600 wrote:

These are available through ftp at EBI.

As the README explains, all 41 cell type footprints are available in one file, and the FOS is shown in column 5 (lower score is a more significant score).

ADD COMMENTlink modified 5.6 years ago • written 5.6 years ago by sjneph600

Direct from the source! Thank you sjneph, much appreciated.

ADD REPLYlink written 5.6 years ago by joshualevipayne70
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 798 users visited in the last hour