TOUCHSTONE II: A new approach to ab initio protein structure prediction

被引：200

作者：

Zhang, Y

Kolinski, A

Skolnick, J ^{[1
]}

机构：

[1] Univ Buffalo, Ctr Excellence Bioinformat, Buffalo, NY 14203 USA

[2] Warsaw Univ, Fac Chem, PL-02093 Warsaw, Poland

来源：

BIOPHYSICAL JOURNAL | 2003年 / 85卷 / 02期

关键词：

D O I：

10.1016/S0006-3495(03)74551-2

中图分类号：

Q6 [生物物理学];

学科分类号：

071011 ;

摘要：

We have developed a new combined approach for ab initio protein structure prediction. The protein conformation is described as a lattice chain connecting C-alpha atoms, with attached C-beta atoms and side-chain centers of mass. The model force field includes various short-range and long-range knowledge-based potentials derived from a statistical analysis of the regularities of protein structures. The combination of these energy terms is optimized through the maximization of correlation for 30 x 60,000 decoys between the root mean square deviation (RMSD) to native and energies, as well as the energy gap between native and the decoy ensemble. To accelerate the conformational search, a newly developed parallel hyperbolic sampling algorithm with a composite movement set is used in the Monte Carlo simulation processes. We exploit this strategy to successfully fold 41/100 small proteins (36 similar to 120 residues) with predicted structures having a RMSD from native below 6.5 Angstrom in the top five cluster centroids. To fold larger-size proteins as well as to improve the folding yield of small proteins, we incorporate into the basic force field side-chain contact predictions from our threading program PROSPECTOR where homologous proteins were excluded from the data base. With these threading-based restraints, the program can fold 83/125 test proteins (36 similar to 174 residues) with structures having a RMSD to native below 6.5 Angstrom in the top five cluster centroids. This shows the significant improvement of folding by using predicted tertiary restraints, especially when the accuracy of side-chain contact prediction is >20%. For native fold selection, we introduce quantities dependent on the cluster density and the combination of energy and free energy, which show a higher discriminative power to select the native structure than the previously used cluster energy or cluster size, and which can be used in native structure identification in blind simulations. These procedures are readily automated and are being implemented on a genomic scale.

引用

页码：1145 / 1164

页数：20

共 41 条

[1] PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].