Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Comput Mol Sci

Optimization of protein models

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Protein structure predictions, and experimentally derived protein structures, very often require certain structure improvement (refinement), which means bringing it closer to real, usually in vivo working conformations. In respect to the variety of protein models to be refined, computational optimization procedures could be divided into localized (applied to a small part of a structure) and global (whole structure). Generally speaking, the first problem is usually tractable, and the latter remains to be extremely challenging for systems larger then peptides or small proteins: optimization complexity and difficulty dramatically increase with the size of structures to be optimized. © 2012 John Wiley & Sons, Ltd.

Figure 1.

Stages of integrative structure determination and analogous structure prediction. Structure determination by integrating varied data from experiments and modeling can be divided into four steps12: (1) generation of structural data by experiment, (2) data translation into spatial restraints, (3) optimization, and (4) ensemble analysis. These four steps are also characteristic for structure prediction, shown here for easy modeling cases (unambiguous data derived from homologous proteins) and difficult ones (ambiguous or sparse data) embedding fold recognition methods, together with de novo modeling.

[ Normal View 72K | Magnified View 94K ]
Figure 2.

Approximate dependence of sequence alignment accuracy and expected models accuracy on the percentage of sequence identity. The classification of model accuracy (see Table 1) practically agrees with alignment accuracy, which falls into one of the following three zones of sequence similarity (defined by Rost15): safe, twilight, and midnight zone. The twilight zone denotes a huge drop in alignment accuracy (roughly in the range of 25%–30% of sequence identity) from the safe zone (high level of sequence similarity—proteins also have similar structures and functions) to the midnight zone (low level of sequence similarity—protein similarity cannot be detected from sequence comparisons alone).

[ Normal View 9K | Magnified View 22K ]
Figure 3.

Results of TR592 refinement from the CASP9 refinement competition. Top: Three superimposed structures are shown—native in blue, starting model in gray [coordinate root mean square deviation (crmsd): 1.26], and the best model in magenta (crmsd: 0.96). Significant improvement from the starting model can be observed in the loop region marked with a circle. It is noteworthy that this region was pointed out by the competition assessors to the participating groups as one of the main areas for refinement. Bottom: In the same size and orientation as mentioned above; all the predicted models (designated by participating groups as the best) are shown in magenta, together with the native in blue (a few heavily mismatched models were removed for clarity).

[ Normal View 75K | Magnified View 97K ]
Figure 4.

Distribution of coordinate root mean square deviation (crmsd) of the starting structures for the refinement in CASP8 and CASP9. The majority of models are close to the native structure.

[ Normal View 5K | Magnified View 14K ]
Figure 5.

Distribution of coordinate root mean square deviation (crmsd) differences between structures before and after refinement, obtained during the CASP9 experiment. Green color (negative Δcrmsd) means improvements and red (positive Δcrmsd) means worsening of the original models.

[ Normal View 46K | Magnified View 53K ]
Figure 6.

Distribution of Δcrmsd (coordinate root mean square deviation) for each method. Successful results are below the red dashed line. Only for eight of 34 methods have the sets of the resulting structures mean Δcrmsd lower than zero.

[ Normal View 48K | Magnified View 56K ]

Browse by Topic

Simulation Methods > Molecular Dynamics and Monte-Carlo Methods
blog comments powered by Disqus

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts