Question: Comparing peaks between multiple samples
0
gravatar for emilyprice16
5 weeks ago by
emilyprice160 wrote:

I have a set of ChIP-seq peaks for 4 samples. I want to create one file in which the intersected genomic peaks are listed along with how many out of the 4 samples it appeared. I would also like the genomic region of the intersect over multiple samples to have the lowest start and highest end coordinates across the samples. E.g Sample A - Chr 1 349-678 Sample B - Chr 1 328-669 Sample C - Chr 1 330-671 Sample D - Chr 1 351-677

I would like it come back as Chr 1 328-678 and it was in 4 samples. I think the bedtools multiinter is something I can use but there doesn’t seem to be much information on it and also how to do it according to what I want. Can anyone help in what the best way going about this is?

chip-seq bedtools • 146 views
ADD COMMENTlink modified 27 days ago by Rory Stark790 • written 5 weeks ago by emilyprice160
0
gravatar for Rory Stark
4 weeks ago by
Rory Stark790
University of Cambridge, Cancer Research UK - Cambridge Institute
Rory Stark790 wrote:

The Bioconductor tool DiffBind does exactly this, merging peaks and maintaining how many (and which) peaksets they were called in.

ADD COMMENTlink written 4 weeks ago by Rory Stark790

That is just what I am after! I have looked through the DiffBind reference manual and can follow how to do this. I have another question after reading through it though - in section 6.3 it demonstrates how you can identify sites that are unique to a sample group. In that example resistant vs responsive in the tamoxifen dataset and you can produce Venn diagrams which nicely show the overlap and the unique peak numbers. I have ChIP-seq results for two transcription factors at different conditions that were performed using the same samples. I can see from the tamoxifen example that I will be able to use DiffBind to look at the difference between the two conditions for each of the transcription factors individually. However can I look at the the overlap between the two transcription factors at the same condition in a certain % of samples? If so could I do this by uploading a sample sheet that contains the samples for the two transcription factors at the one condition. Instead of the other way round in which the sample sheet would be for one TF containing the samples of the two different conditions.

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by emilyprice160
0
gravatar for Rory Stark
27 days ago by
Rory Stark790
University of Cambridge, Cancer Research UK - Cambridge Institute
Rory Stark790 wrote:

Yes you can do this. You can divide things up using any combination of meta-factors (Tissue, Factor, Condition, Treatment, Replicate), each of which can contain anything you like, or by calling out any subsets of peak using masks.

Cheers-

Rory

ADD COMMENTlink written 27 days ago by Rory Stark790
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1546 users visited in the last hour