Question

Do specialized Hi-C databases exist?

5

Entering edit mode

8.7 years ago

svet.sidorov ▴ 110

Dear colleagues,

Does somebody know, if there are specialized databases for Hi-C raw data (paired-end reads)? Of course, there are many raw Hi-C datasets on GEO, but I failed to find any specialized Hi-C database on the Web.

database Hi-C • 6.3k views

ADD COMMENT • link updated 19 months ago by Ram 43k • written 8.7 years ago by svet.sidorov ▴ 110

0

Entering edit mode

Are you only interested in Hi-C or also interested in other chromosomal looping datasets such as 4C/5C and ChIA-PET

ADD REPLY • link 8.7 years ago by Ying W ★ 4.2k

0

Entering edit mode

I'm interested predominantly in Hi-C, but other 'C' datasets maybe also interesting.

ADD REPLY • link 8.7 years ago by svet.sidorov ▴ 110

6

Entering edit mode

7.3 years ago

Fidel ★ 2.0k

We have been working on an initiative to host and visualize Hi-C data. We download the data from SRA and processes it with our pipeline to generate downloadable matrix files (stored on hdf5 files). So far we have some data for drosophila, mouse and human available.

The website is:

http://chorogenome.ie-freiburg.mpg.de

Feedback and support is welcome.

A screenshot for human GM12878 cell lines looks like this:

enter image description here

ADD COMMENT • link 7.3 years ago by Fidel ★ 2.0k

1

Entering edit mode

Fidel, this is amazing. A much-needed and much-appreciated initiative.

ADD REPLY • link 7.3 years ago by Ryan Dale 5.0k

3

Entering edit mode

7.7 years ago

Azamat Gafurov ▴ 30

There is a LOGIQA (Long-range genome interactions quality assessment) database. However it hosts only quality scores (assessed over Hi-C, 4C, ChIA-PET, Capture-C, etc.). Nevertheless it is very convenient for searching data you look for.

ADD COMMENT • link 7.7 years ago by Azamat Gafurov ▴ 30

0

Entering edit mode

The above mentioned link is not working.

http://ngs-qc.org/logiqa/index.php

ADD REPLY • link 7.7 years ago by EagleEye 7.5k

0

Entering edit mode

8.6 years ago

Bryan Lajoie ▴ 20

I don't believe there are any specialized Hi-C databases on the web...

(perhaps that will change with ENCODE3 + 4D Nucleome efforts)

Though Ryan is correct - I doubt anyone would duplicate what is already stored on GEO/SRA. Most paper submissions (GEO) include not only the fastq files but also some form of the interaction data. The data will either be in a tsv (3 column) format, tsv (matrix) format, or hdf5.

I think GEO will be your best bet for the time being.

ADD COMMENT • link updated 4.4 years ago by Ram 43k • written 8.6 years ago by Bryan Lajoie ▴ 20

0

Entering edit mode

7.0 years ago

encoder • 0

This website might be helpful to search Hi-C datasets available: http://promoter.bx.psu.edu/hi-c/view.php

ADD COMMENT • link 7.0 years ago by encoder • 0

Ram · Accepted Answer · 2015-08-28

3

Entering edit mode

8.7 years ago

Ryan Dale 5.0k

I wouldn't expect anyone to duplicate storage of raw reads beyond what SRA and GEO do. But as far as aggregating multiple experiments, the WashU Epigenome Browser has long-range chromatin interaction experiment tracks available. Currently there are 73 tracks available for human, 27 each for mouse and fly.

ADD COMMENT • link updated 19 months ago by Ram 43k • written 8.7 years ago by Ryan Dale 5.0k

0

Entering edit mode

Ryan, thank you for the information!

ADD REPLY • link 8.7 years ago by svet.sidorov ▴ 110