Question

How To Define A Protein As Kinase Protein?

1

Entering edit mode

10.6 years ago

Nims ▴ 90

I would like to know that how can we precisely identify if a protein is kinase protein or not based on some gene ontology (GO) terms. Like EGFR in UniProt has some GO terms.

protein • 3.6k views

ADD COMMENT • link updated 2.4 years ago by Ram 43k • written 10.6 years ago by Nims ▴ 90

1

Entering edit mode

You will have to include more info. What "kind" of protein do you have? Accession numbers, unannotated sequences? Model-organisms? Non modal-organisms?

ADD REPLY • link 10.6 years ago by David Westergaard ★ 1.5k

0

Entering edit mode

Found the solution - UniProt has keyword annotations and some of these are also for kinase.

ADD REPLY • link 10.6 years ago by Nims ▴ 90

score 3 · Answer 1 · 2013-08-27

A kinase chemically phosphorylates others proteins. You could precisely identify that your protein is a kinase by determining if your protein phosphorylates another protein.

You're asking how to do this bioinformatically (which would not be definitive), but I would look at the protein composition (amino acid sequence), the estimated folding structure, and any additional tertiary indications and see how homologous these characters are in relation to other kinase proteins in databases, such as GO. Does your protein have motifs in common with other proteins that have been fully characterized to be kinases?

Ram · Answer 2 · 2014-08-02

You shouldn't use GO terms to define protein kinases. There are more sophisticated ways to make these predictions, such as sequence scanning, on a domain level. A solution would be to use HMMER, combined with the kinomer library (this is similar to what PFAM does), except it will predict on a kinase family level.

Download and install HMMER from http://hmmer.janelia.org/
Download Kinomer HMM Libraries from http://www.compbio.dundee.ac.uk/kinomer/download.html
Run hmmer using the downloaded kinomer library:

hmmsearch --domtblout predictions.txt allPK.hmm seqs.fasta

Where seqs.fasta is a file containing the protein sequences you want to scan in fasta format, allPK.hmm is the HMM library downloaded from kinomer, and predictions.txt is the file containing the resulting output.

For more help on hmmer:

hmmsearch -h

Alternatively, you could use the web interface provided by kinomer: http://www.compbio.dundee.ac.uk/kinomer/bin/runHMMer.pl ?

Good luck

score 1 · Answer 3 · 2014-08-02

If you want a list of "known" kinases according to various authorities you can use the 'Browse Categories' function of DGIdb and navigate to the generic kinase category, or more specifically to Tyrosine, Serine Threonine, PI3, or Lipid kinases.

Alternatively, if you want to query a single gene or list of genes against these categories you can use the 'Search Categories' function of DGIdb and enter your gene(s) and filter against which sources you trust as a definition of kinases.

Finally, you can use the DGIdb API's Genes in Category endpoint to get at this information programmatically.

Kinases in DGIdb are defined in various ways by different sources (including the Gene Ontology). A recent "reliable" list would be the dGene list which is based on review of the literature. See the sources and their corresponding publications for more details of how they defined kinases.

Ram · Answer 4 · 2014-08-03

Slightly surprised no one already suggested http://www.ebi.ac.uk/interpro/interproscan.html (Pfam HMM matching is part of it). UniProt and Ensembl run this by default. As noted here for EC number assignment tools GO may happily assign function to "dead" kinases so look carefuly at the catalytic site residues (see Regarding Pseudokinases). Then, as Josh says, get someone to do the enzymology experiments (kinases are popular http://cdsouthan.blogspot.se/2013/11/drug-target-time-tracking.html so your chances are good)

score 1 · Answer 5 · 2014-08-03

1

Entering edit mode

9.7 years ago

sgruenwald ▴ 10

Every kinase has to have a DFG motif. You can quickly screen for that. If it has, you can go more deeper into domain analysis.

ADD COMMENT • link 9.7 years ago by sgruenwald ▴ 10

score 0 · Answer 6 · 2013-08-27

0

Entering edit mode

10.6 years ago

sanchezcavani ▴ 220

Damain annotation might be the best way I think. Just have a look at whether a kinase domain exists there.

ADD COMMENT • link 10.6 years ago by sanchezcavani ▴ 220