Question: Plotting chip signal at TSSs with deeptools
gravatar for srhic
6 months ago by
srhic0 wrote:


I have a very basic question about using deeptools computeMatrix function to plot ChIP/ATAC-seq signal across TSS sites. I am just curious what is the ‘standard’ way of making the bed file of the TSSs I want to plot. I can make the bed file in two ways:

  1. I get the gene start coordinate of my genes of interest from an online database like ensemble and make the bed file. This gives me one start site per gene.

  2. I get all ensemble annotated TSSs associated with a gene which gives me multiple regions associated with each gene. This seems like the better option as I am not ignoring any TSS but I am concerned that since most TSS are pretty close to each other I will be double counting most of them.

Since such plots are very common in literature, I was just curious what is the best practice for making them?

ADD COMMENTlink modified 6 months ago by Devon Ryan94k • written 6 months ago by srhic0
gravatar for Devon Ryan
6 months ago by
Devon Ryan94k
Freiburg, Germany
Devon Ryan94k wrote:

Within the deepTools team we use the second option you listed. Yes, this results in some duplication, but since the duplicate regions are ambiguous in terms of which transcript they're likely to affect this ends up maintaining more biological meaning.

ADD COMMENTlink written 6 months ago by Devon Ryan94k

Thanks. I found an old thread mentioning the same but also recommending that overlapping regions in the bed file be merged before plotting. Do you think that is a good idea?

ADD REPLYlink written 6 months ago by srhic0

There's no need to merge the regions and that's rarely a good idea.

ADD REPLYlink written 5 months ago by Devon Ryan94k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1111 users visited in the last hour