Question: How can I see the distribution of TFBS on specific enhancer in UCSC
0
gravatar for z17645797947
14 days ago by
z176457979470 wrote:

I have the location information of TFBS and enhancer such as: chr1:851083-852146, and I would like to know the distribution of known TFBS on a specific enhancer. Whether UCSC Genome Browser can help me to identify it?

Can anyone help me?

ucsc tfbs gene • 190 views
ADD COMMENTlink modified 13 days ago by simon.vanheeringen110 • written 14 days ago by z176457979470

Do you want the distribution or just to know which are enriched for your enhancer regions? If it's the latter, you can try AME, which is made for just that.

ADD REPLYlink written 13 days ago by jared.andrews07320
2
gravatar for Alex Reynolds
14 days ago by
Alex Reynolds21k
Seattle, WA USA
Alex Reynolds21k wrote:

The UCSC Genome Browser does not offer a TFBS track, to my knowledge.

You could use FIMO to call TFBS on your reference genome, against a database like JASPAR. Or TRANSFAC, if you have a license.

Once you have TFBS in BED format, you could use BEDOPS bedmap to count or call TF hits from your FIMO scan that associate (overlap with) your regions of interest, such as your enhancer, or any number of other sites in BED format.

ADD COMMENTlink written 14 days ago by Alex Reynolds21k

Do you have an estimate on how long FIMO takes for hg38?

ADD REPLYlink written 13 days ago by ATPoint2.5k

Depends how many PWMs you want it to look for. If you use a full database, like all of the JASPAR CORE, it takes a pretty long time, but you can cut it down by breaking the genome up into chromosomes and using GNU Parallel to run it on them in parallel, then just concatenate the results. The resulting file will be very large though, so be prepared.

If you only have a few PWMs of interest, it's pretty quick. I think it remember it taking 2-3 hours, and that was without breaking up things by chromosome. Really made for scanning specific regions (like putative regulatory elements), rather than the entire genome, I think.

ADD REPLYlink written 13 days ago by jared.andrews07320
1

If you use a lower threshold (1e-5) a whole-genome scan is tractable and easy to do repeat set operations on with bedmap etc. Can be compressed with starch to make once and read/map many times without taking too much disk space.

ADD REPLYlink modified 13 days ago • written 13 days ago by Alex Reynolds21k

Thank you,This helps a lot!

ADD REPLYlink written 10 days ago by z176457979470
1
gravatar for simon.vanheeringen
13 days ago by
simon.vanheeringen110 wrote:

You can check the ReMap UCSC track (based on ChIP-seq data):

http://tagc.univ-mrs.fr/remap/index.php?page=browse

And JASPAR 2018 has a UCSC track for predicted binding sites:

http://jaspar.genereg.net/genome-tracks/

ADD COMMENTlink written 13 days ago by simon.vanheeringen110
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 589 users visited in the last hour