A General Evaluation Of Chipseq Data
3
7
Entering edit mode
12.1 years ago
Hamilton ▴ 290

is there any measure to validate if the one chipseq experiment is good or not?? that is, the data are good enough to analyze or the second experiment should be redone as the 1st experiment was not good? how can we measure such a quality usually? sort of quality measure to account for non-redundant reads for peak regions at a given sequencing depth.(for say, 30M reads in the experiment)?, namely, how many non-redundant reads are mapped in peak regions from the experiment? Otherwise, any other criteria to consider??

basically, how do we evaluate a chipseq experiment quality? Quality scores from galaxy using fastq files are good enough? any other statistical measures?

Any comments are welcome.

chip-seq quality • 5.2k views
ADD COMMENT
10
Entering edit mode
12.1 years ago

The ENCODE and Roadmap projects have adopted a number of QC criteria for ChIP-SEQ data:

  • Proportions of aligned and duplicate reads
  • Peak enrichment
  • Peak 'Lag' between strands
  • Reproducibility between replicates

See the documents below for more detail.

http://genome.ucsc.edu/ENCODE/protocols/dataStandards/ChIP_DNase_FAIRE_DNAme_v2_2011.pdf

http://www.roadmapepigenomics.org/files/protocols/data/histone-modification/REMC_ChIP-seqStandardsFINAL.pdf

ADD COMMENT
4
Entering edit mode
12.1 years ago

Usually one would look whether there is significant enrichment of reads in the ChIP sample compared to the control (eg. input DNA or nonspecific antibody) in specific regions. Perhaps some regions of enrichment are known in advance; if not, some peaks you are seeing in the data could be checked with ChIP-qPCR.

Computationally, if you know the motif for the TF you are doing ChIP on (if it is a TF) you can check whether the motif is significantly enriched in the peaks you get.

On a more basic level, I would find it worrying if I did peak calling for both ChIP over control and (reversed) control over ChIP, and got more peaks for control over ChIP.

ADD COMMENT
2
Entering edit mode
12.1 years ago
Ian 6.0k

Just a couple of tools i have found useful to answer this question:

HOMER (Perl based)

htSeqTools (R based; i particularly like the PCA represenation of coverage similarity between the samples)

ADD COMMENT
0
Entering edit mode

It'd be more helpful if you could briefly say why one would want to use the tool/what to look for in the output.

EDIT: I now realise this post is three years old...

ADD REPLY

Login before adding your answer.

Traffic: 2749 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6