Question

Best Database Of Transcription Factor Binding Sites

9

Entering edit mode

13.1 years ago

Dataminer ★ 2.8k

(a) Which is the best database for transcription factor binding site, TRANSFAC or JASPAR? Why?

(b) Is it still valid to assume, "transcription factor binding sites are only present in 5'UTR and in promoter region"?

Note: kindly put references in your answer.

transcription binding database • 17k views

ADD COMMENT • link updated 15 months ago by nta • 0 • written 13.1 years ago by Dataminer ★ 2.8k

0

Entering edit mode

I think it is pretty optimistic to think that there is a reference to document that one specific database of TF binding sites is better than another. How would you even define "best" in this case?

ADD REPLY • link 13.1 years ago by Lars Juhl Jensen 11k

0

Entering edit mode

Changed title to be more descriptive

ADD REPLY • link 13.1 years ago by Lars Juhl Jensen 11k

0

Entering edit mode

True, I agree, but what should be the preference while selecting one database and not the other.

ADD REPLY • link 13.1 years ago by Dataminer ★ 2.8k

Ram · Answer 1 · 2011-03-16

defining 'best' is a tricky task, both JASPAR and TRANSFAC are good, though one advantage of JASPAR is that it is open vs. TRANSFAC which is not. The free version of TRANSFAC is quite behind the times.
No. TFBSs can occur far away from the TSS as on enhancers http://en.wikipedia.org/wiki/Enhancer_(genetics) or even in introns (see, http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05479005)

score 9 · Answer 2 · 2012-02-17

This very much depends on what you are trying to accomplish.

TRANSFAC and JASPAR are databases of transcription factor binding models. Using experimental techniques like SELEX and now increasingly ChIP-Seq we can identify the kinds of sequences that a particular transcription factor (TF) has affinity for. This affinity is typically represented with something like a position weight matrix (PWM) or position-specific scoring matrix (PSSM). Those models can be used to scan a genome for "putative TF binding sites" that have a high similarity to the model. Whether those sites are actually bound by a TF and play a functional role in transcription and under what conditions must still be determined experimentally be traditional molecular techniques like promoter bashing, reporter gene assays, ChIP experiments, etc.

Databases like ORegAnno and PAZAR have attempted to collect and curate individual experimentally proven TFBSs from the literature and sets from ChIP-Seq and other genome-wide experiments. Technologies like ChIP-Seq can give us a genome-wide profile of all the binding sites for a particular transcription factor under a specific condition. Many of these TF binding sites (TFBSs) are being deposited into species-specific databases (FlyBase, SGD, etc) and as tracks in the UCSC browser. In particular, see efforts by ENCODE. A good starting point would be to look through the tracks under 'Regulation' in the UCSC Genome Browser for your species of interest.

score 3 · Answer 3 · 2011-03-16

I just happened to ask how JASPAR compares to Transfac to members of the JASPAR team present at a meeting in Montpellier, I think about 2 years ago. The answer I got was "Transfac (the Pro version) covers more, but our quality is better".

If you mention Transfac Pro I think you should also mention Genomatix. They stem from the same basic Transfac development in Germany. There were two commercial lines started from this project and they both developed upon the original open Transfac. They were in fact early endeavors to see whether government funded investments in databases could be made sustainable through offering commercial services.

Academic users can use most of parts of the Genomatix toolset and database with a free account. Access to (part of?) the commercial content of the Transfac Pro is possible through websites that use it behind the scenes.

score 3 · Answer 4 · 2012-02-18

You might also want to consider the UniPROBE database which contains PSSM models based on protein-binding microarrays (PBMs).

If you are doing your analysis in yeast, you should consider the YeTFaSCo database which is a curated collection of PSSM models for most of the yeast TFs.

Hundreds of transcription factor (TF) binding preferences have already been queried using PBMs, soon thousands will be. JASPAR and TRANSFAC are a bit slow in keeping up with these new data, it may be worth it to keep an eye on the work of Tim Hughes and Martha Bulyk who are the ones actively generating TF binding preference data using PBMs.

Ram · Answer 5 · 2014-07-28

1

Entering edit mode

9.7 years ago

Maximilian Haeussler ★ 1.6k

Just to show that the free version 3.2 of transfac is really old: Version 3.2 is from 1997.

ftp://ftp.ebi.edu.au/pub/databases/transfac/tfdoc32.txt

ADD COMMENT • link updated 2.4 years ago by Ram 43k • written 9.7 years ago by Maximilian Haeussler ★ 1.6k

score 1 · Answer 6 · 2016-04-21

1

Entering edit mode

8.0 years ago

ivan.kulakovskiy ▴ 10

For PSSMs/PWMs of human/mouse TFs you may also try HOCOMOCO (http://hocomoco.autosome.ru).

ADD COMMENT • link 8.0 years ago by ivan.kulakovskiy ▴ 10

score 0 · Answer 7 · 2016-10-20

0

Entering edit mode

7.5 years ago

jin ▴ 80

You can access the TF binding sites in Plants at PlantRegMap, which are derived from ChIP-seq and DNase I genomic footprinting.

ADD COMMENT • link 7.5 years ago by jin ▴ 80

score 0 · Answer 8 · 2023-01-26

Genexplain recently uploaded a white paper explaining the difference between TRANSFAC and JASPAR. It seems when we use TRANSFAC there are chances that we don't miss important matrices that play a role in regulation of gene of interest. The white paper link can be found here:

https://genexplain.com/wp-content/uploads/2023/01/TRANSFAC-revealed-IRF3-site-in-SNPs-associated-with-high-severity-of-Covid-19-missed-by-JASPAR.pdf

Hopefully this is still helpful to someone.