Where Can I Get The Secondary Structure Of A Protein?
11
5
Entering edit mode
11.8 years ago

As in the title... I have a protein and I would like to know its secundary structure. I couldn't find it in uniprot, althought I tought they had annotations for it there. In the end I have used a predictor (jpred) but there it should be a database somewhere.

sequence protein structure • 6.4k views
10
Entering edit mode
11.8 years ago
Suk211 ★ 1.1k

If you have the PDB file then you can use the standard tool called DSSP , it is supposed to be the gold standard for obtaining secondary structure. In case you just have sequence then I personally prefer PSIPRED , it takes evolutionary information into account to predict the secondary structure . According to CASP evaluation it is one of the best secondary structure predictor available.

6
Entering edit mode
11.8 years ago
Nicojo ★ 1.1k

I think you found the best answer yourself: use a predictor! There are several out there...

You suggest that there should be a Secondary Structure Database. I'm not sure that makes much sense, let me explain my point of view (which may not be that of everyone): most often, the data that is found in databases is the "state of knowledge" of the described object, based on experimentation.

That may be the case for secondary structures of proteins, but only in the case where the said proteins have been crystalized. In those cases, it is not only the secondary structures but also the tertiary structures (with the caveat that the crystal structure of a protein does not prove "all" states that a protein may take in real "dynamic" physiological conditions).

For all those proteins that have not been crystalized, then we can only rely on predictions. And I use them quite frequently: they are extremely useful! But as far as I know, no prediction is accepted as fact. They're "educated guesses" that are often correct, but sometimes wrong. The results may differ from one prediction method to another. Also they change each time the algorithms are improved...

If there was a database of predicted secondary structures, people would likely take them for granted (make the equation prediction = fact) which would be quite "unscientific".

I think such a resource would be more of a hindrance than an asset to the scientific community...

4
Entering edit mode
11.7 years ago

If it is only one sequence you may try PSIPRED server. If you need to work on a large sequence dataset, better to install PSIPRED locally. PSIPRED runs are typically computational intensive.

3
Entering edit mode
11.8 years ago

Protein structure prediction is a complex issue that is likely to require multiple approaches. There are many methods/tools listed at the

3
Entering edit mode
11.8 years ago
Darked89 4.2k

May be a little bit dated, but let me blow my own trumpet (collection of links).

http://openwetware.org/wiki/Wikiomics:Bioinfo_tutorial#Protein_localization_and_structure_prediction

3
Entering edit mode
11.7 years ago
Nir London ▴ 60

If you do have the protein structure (PDB file), Stride is also a good option for assigning the secondary structure.

2
Entering edit mode
11.7 years ago
Kirsley ▴ 50

If you want to obtain domains as well as the annotations that come along, you can do it locally with an RPS-BALST. Here for example to obtain Pfam annotations:

rpsblast -i ".$InputPath."/".$item." -d ~/Bioinfo/cdd/Pfam -e 0.000000000001 -o ".\$elemt[0]."_Pfam.rpsblast -T T -m 7

• -i = the input path
• -d = the database path
• -e = the e-value cut-off value
• -o = the output name
• -T T and -m 7 = to have the output in XML format

You can download all the databases from CDD. You'll obtain external source databases like Pfam, SMART, COG, PRK, TIGRFAM.

2
Entering edit mode
11.0 years ago
Thaman ★ 3.3k

You have got all the answers needed for your query. The structure of the protein generally comes after the X-Ray crystallography (which crystallized 80% of the protein structure existed) or NMR technology.

• Protein Data Bank (PDB) - It's the best and reliable options to find out all the structure available for your protein.

• Pymol - Visualization and modelling in a 3D. You can visualize either by uploading PDB file or searching through Load structure Plugin.

1
Entering edit mode
11.0 years ago

If you want to predict the secondary structure from a protein 3D structure DSSP is one of the best algorithm. following is the link for the same. http://swift.cmbi.ru.nl/gv/dssp/

0
Entering edit mode
5.5 years ago
102316109 • 0

0
Entering edit mode
3.8 years ago
Suzanne ▴ 80

Jalview (http://www.jalview.org/) uses the fast and pretty accurate Jpred secondary structure predictor, there is a video on this on YouTube https://youtu.be/z5cVjR9Q3Mw