Journal of Physical Chemistry B, Vol.108, No.14, 4525-4534, 2004
Design of a protein potential energy landscape by parameter optimization
We propose an automated protocol for designing the energy landscape of a protein energy function by optimizing its parameters. The parameters are optimized so that not only the global minimum-energy conformation becomes nativelike but also the conformations distinct from the native structure have higher energies than those close to the native structure. We classify low-energy conformations into three groups: supernative, nativelike, and non-native. The supernative conformations have all backbone dihedral angles fixed to their native values, and only their side chains are minimized with respect to energy. However, the nativelike and non-native conformations all correspond to the local minima of the energy function. These conformations are ranked according to their root-mean-square deviation (rmsd) of backbone coordinates from the native structure, and a fixed number of conformations with the smallest rmsd values are selected as nativelike conformations, whereas the rest are considered to be non-native conformations. We define two energy gaps E-gap((1)) and E-gap((2)). The energy gap E-gap((1)) (E-gap((2))) is the energy difference between the lowest energy of the non-native conformations and the highest energy of the nativelike (supernative) conformations. The parameters are modified to decrease both E-gap((1)) and E-gap((2)). In addition, the non-native conformations with larger rmsd values are made to have higher energies relative to those with smaller rmsd values. We successfully apply our protocol to the parameter optimization of the UNRES potential energy using the training set of betanova, 1fsd, the 36-residue subdomain of chicken villin headpiece (PDB ID 1vii), and the 10-55 residue fragment of staphylococcal protein A (PDB ID 1bdd). The new protocol of the parameter optimization shows better performance than earlier methods where only the difference between the lowest energies of nativelike and non-native conformations was adjusted without considering various nativelike degrees of the conformations. We also perform jackknife tests on other proteins not included in the training set and obtain promising results. The results suggest that the parameters we obtained using the training set of the four proteins are transferable to other proteins to some extent.