Fold assembly of small proteins using Monte Carlo simulations driven by restraints derived from multiple sequence alignments

被引:78
作者
Ortiz, AR
Kolinski, A
Skolnick, J
机构
[1] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
[2] Univ Warsaw, Dept Chem, PL-02093 Warsaw, Poland
关键词
protein structure prediction; correlated mutations; side-chain contact prediction; lattice protein models; secondary structure prediction; threading;
D O I
10.1006/jmbi.1997.1595
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The feasibility of predicting the global fold of small proteins by incorporating predicted secondary and tertiary restraints into ab initio folding simulations has been demonstrated on a test set comprised of 20 nonhomologous proteins, of which one was a blind prediction of target 42 in the recent CASP2 contest. These proteins contain from 37 to 100 residues and represent all secondary structural classes and a representative variety of global topologies. Secondary structure restraints are provided by the PHD secondary structure prediction algorithm that incorporates multiple sequence information. Predicted tertiary restraints are derived from multiple sequence alignments via a two-step process. First, seed side-chain contacts are identified from correlated mutation analysis, and then a threading-based algorithm is used to expand the number of these seed contacts. A lattice-based reduced protein model and a folding algorithm designed to incorporate these predicted restraints is described. Depending upon fold complexity, it is possible to assemble native-like topologies whose coordinate root-mean-square deviation from native is between 3.0 Angstrom and 6.5 Angstrom. The requisite level of accuracy in side-chain contact may prediction can be roughly 25% on average, provided that about 60% of the contact predictions are correct within +/-1 residue and 95% of the predictions are correct within +/-4 residues. Precision in tertiary contact prediction is more critical than absolute accuracy. Furthermore, only a subset of the tertiary contacts, on the order of 25% of the total, is sufficient for successful topology assembly. Overall, this study suggests that the use of restraints derived from multiple sequence alignments combined with a fold assembly algorithm holds considerable promise for the prediction of the global topology of small proteins. (C) 1998 Academic Press Limited.
引用
收藏
页码:419 / 448
页数:30
相关论文
共 58 条
[1]   Homology modelling by distance geometry [J].
Aszodi, A ;
Taylor, WR .
FOLDING & DESIGN, 1996, 1 (05) :325-334
[2]   GLOBAL FOLD DETERMINATION FROM A SMALL NUMBER OF DISTANCE RESTRAINTS [J].
ASZODI, A ;
GRADWELL, MJ ;
TAYLOR, WR .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (02) :308-326
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNS [J].
CHOTHIA, C ;
FINKELSTEIN, AV .
ANNUAL REVIEW OF BIOCHEMISTRY, 1990, 59 :1007-1039
[5]   ON THE PREDICTION OF PROTEIN-STRUCTURE - THE SIGNIFICANCE OF THE ROOT-MEAN-SQUARE DEVIATION [J].
COHEN, FE ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1980, 138 (02) :321-333
[6]   Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions [J].
Dandekar, T ;
Argos, P .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 256 (03) :645-660
[7]   FOLDING THE MAIN-CHAIN OF SMALL PROTEINS WITH THE GENETIC ALGORITHM [J].
DANDEKAR, T ;
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 236 (03) :844-861
[8]   Multiple sequence information for threading algorithms [J].
Defay, TR ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (02) :314-323
[9]   Meeting review: The Second Meeting on the Critical Assessment of Techniques for Protein Structure Prediction (CASP2), Asilomar, California, December 13-16, 1996 [J].
Dunbrack, RL ;
Gerloff, DL ;
Bower, M ;
Chen, XW ;
Lichtarge, O ;
Cohen, FE .
FOLDING & DESIGN, 1997, 2 (02) :R27-R42
[10]   Computational studies of protein folding [J].
Friesner, RA ;
Gunn, JR .
ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 1996, 25 :315-342