TOUCHSTONE II: A new approach to ab initio protein structure prediction

被引:200
作者
Zhang, Y
Kolinski, A
Skolnick, J [1 ]
机构
[1] Univ Buffalo, Ctr Excellence Bioinformat, Buffalo, NY 14203 USA
[2] Warsaw Univ, Fac Chem, PL-02093 Warsaw, Poland
关键词
D O I
10.1016/S0006-3495(03)74551-2
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
We have developed a new combined approach for ab initio protein structure prediction. The protein conformation is described as a lattice chain connecting C-alpha atoms, with attached C-beta atoms and side-chain centers of mass. The model force field includes various short-range and long-range knowledge-based potentials derived from a statistical analysis of the regularities of protein structures. The combination of these energy terms is optimized through the maximization of correlation for 30 x 60,000 decoys between the root mean square deviation (RMSD) to native and energies, as well as the energy gap between native and the decoy ensemble. To accelerate the conformational search, a newly developed parallel hyperbolic sampling algorithm with a composite movement set is used in the Monte Carlo simulation processes. We exploit this strategy to successfully fold 41/100 small proteins (36 similar to 120 residues) with predicted structures having a RMSD from native below 6.5 Angstrom in the top five cluster centroids. To fold larger-size proteins as well as to improve the folding yield of small proteins, we incorporate into the basic force field side-chain contact predictions from our threading program PROSPECTOR where homologous proteins were excluded from the data base. With these threading-based restraints, the program can fold 83/125 test proteins (36 similar to 174 residues) with structures having a RMSD to native below 6.5 Angstrom in the top five cluster centroids. This shows the significant improvement of folding by using predicted tertiary restraints, especially when the accuracy of side-chain contact prediction is >20%. For native fold selection, we introduce quantities dependent on the cluster density and the combination of energy and free energy, which show a higher discriminative power to select the native structure than the previously used cluster energy or cluster size, and which can be used in native structure identification in blind simulations. These procedures are readily automated and are being implemented on a genomic scale.
引用
收藏
页码:1145 / 1164
页数:20
相关论文
共 41 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]  
[Anonymous], 2012, Introduction to protein structure
[3]   A surprising simplicity to protein folding [J].
Baker, D .
NATURE, 2000, 405 (6782) :39-42
[4]   PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES [J].
BENNER, SA ;
GERLOFF, D .
ADVANCES IN ENZYME REGULATION, 1991, 31 :121-181
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]  
Betancourt MR, 2001, J COMPUT CHEM, V22, P339, DOI 10.1002/1096-987X(200102)22:3<339::AID-JCC1006>3.0.CO
[7]  
2-R
[8]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[9]   SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modeling [J].
Guex, N ;
Peitsch, MC .
ELECTROPHORESIS, 1997, 18 (15) :2714-2723
[10]   PREDICTION OF PROTEIN ANTIGENIC DETERMINANTS FROM AMINO-ACID-SEQUENCES [J].
HOPP, TP ;
WOODS, KR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1981, 78 (06) :3824-3828