"Good" epigenetic histone modification ChIP-seq datasets
4
2
Entering edit mode
8.2 years ago

Could anyone recommend any "good" publicly available (e.g., from the SRA/GEO) ChIP-seq datasets with broad histone marks? In general, I'd like to use these data for testing out novel (and existing) ChIP-seq methodologies for benchmarking purposes.

I'm curious to see if the community considers any specific existing epigenetic histone modification ChIP-seq datasets to be particularly "clean" (e.g., high SNR ratio of peaks relative to noise in gene deserts) that would be particularly suitable as training data, for example, for development and testing of new algorithms. Note I am not interested in transcription-factor ChIP-seq data, only histone modification ChIP-seq data (it does not matter what the specific histone mark under study is, as long as the data is suitable for the purposes outlined above).

Relevant post: Chip-Seq Datasets For Cancer Genome Samples?

ChIP-Seq • 2.3k views
ADD COMMENT
0
Entering edit mode

What about the ENCODE data?

ADD REPLY
4
Entering edit mode
8.2 years ago
Emily 23k

ENCODE, Roadmap Epigenomics and Blueprint. You'll find a few thousand ChIP-seq experiments between those three.

ADD COMMENT
3
Entering edit mode
8.2 years ago
cbio ▴ 450

Here's a dataset generated by my lab for a paper. The Chip-Seq data is solid, hopefully it'll be helpful for what you're trying to accomplish.

ADD COMMENT
2
Entering edit mode
8.2 years ago
John 13k

IHECis the next-generation international Epigeneomic sequencing initiative, although not all the data has been sequenced/released yet.

Personally, I would look at getting some of Germany's contribution to IHEC (data from DEEP) as one of our goals is to produce/publish extremely deep sequencing for histone modifications, RNA, WGBS, DNAse, etc. For a training dataset I don't think you can get much better :)

ADD COMMENT
0
Entering edit mode
8.2 years ago
jotan ★ 1.3k

All the databases suggested above are good places to download.

The commonly studies histone modifications tend to have the cleanest signals.

The H3K4me3 antibodies are very good and almost all the K4me3 ChIP-seq datasets are clean and crystal clear.

H3K36me3 is also tends to have yield good ChIP-seq data but the read counts must be sufficiently high. Any recently generated data should be suitable.

The Jenuwein group has also published some decent H3K9me3 ChIP-seq in mouse cells

ADD COMMENT

Login before adding your answer.

Traffic: 1892 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6