Question

Proteins Suitable For A Structure Prediction Project ?

2

Entering edit mode

10.5 years ago

curiousgeorge ▴ 20

Hello there !

I am pursuing a project where I intend to (required to) predict the structure of a protein using homology modelling. I have searched the internet a lot but couldn't find a suitable candidate. Can anybody already having experience in the area shed some light on the proteins which will be suitable for this purpose, like for example whose structures are not known and are under 400 residues?

Regards,

homology-modeling • 2.5k views

ADD COMMENT • link updated 14 months ago by Ram 43k • written 10.5 years ago by curiousgeorge ▴ 20

score 3 · Answer 1 · 2013-10-10

3

Entering edit mode

10.5 years ago

Neilfws 49k

The median length of proteins from most organisms is < 400 and of the millions of proteins sequenced, some tens of thousands have structures. So in fact most proteins fit your example criteria.

I would work a little harder on a selection strategy.

ADD COMMENT • link 10.5 years ago by Neilfws 49k

0

Entering edit mode

Thanks, it soothes me to know that there are tons and tons of candidates out there !

ADD REPLY • link 10.5 years ago by curiousgeorge ▴ 20

score 1 · Answer 2 · 2013-10-11

I guess you did BLAST your sequence (for instance in NCBI), and check if any hit was linked to the PDB database, right? That would be one way to find a candidate. A less efficient way to do a similar thing would be to type in the name of your protein and see if it can be found at the PDB... However, when you said you have searched the internet a lot, it likely already included these possibilities.

Alternatively, some modelling servers actually do find out templates for you -if available-. I believe I-tasser is somewhere in-between homology and ab-initio modelling: http://zhanglab.ccmb.med.umich.edu/I-TASSER/about.html ...and starts by looking for templates. It will be slow, though.

Also, the Robetta.org server, which performs Ab-initio modelling seems to look for templates in its initial step, the "Ginzu", by which functional domains are tried to be assigned. It will take you some days to have a result, however; sometimes weeks if you request a full model or the server load is too high.

Hope this helps. Cheers,

K

score 1 · Answer 3 · 2013-10-11

Adding to Neilfws' answer:

I would look at the Protein Structure Initiative websites and their targets. Another interesting idea is to look for all structures deposited in the last half-year in the RCSB Protein DataBank and see which one pleases you from a biological point of view, and then try to extrapolate it for another organism. For example, you find hen lysozime and you try and model human lysozime. Don't try to find an extremely tough target. From what I understand, you are only required to model something, probably to learn the whole process and how it works, so the final result is irrelevant. Choose something appealing and easy to present, but that also has a few challenges (identity < 60% and a few gaps for example).

Crazy suggestions: E2/E3 proteins from the ubiquitination pathway.