Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions

被引:73
作者
Huang, ES
Samudrala, R
Ponder, JW [1 ]
机构
[1] Washington Univ, Sch Med, Dept Biochem & Mol Biophys, St Louis, MO 63110 USA
[2] Stanford Univ, Dept Biol Struct, Sch Med, Stanford, CA 94305 USA
关键词
protein structure prediction; distance geometry; helix packing;
D O I
10.1006/jmbi.1999.2861
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The problem of protein tertiary structure prediction from primary sequence can be separated into two subproblems: generation of a library of possible folds;md specification of a best fold given the library. A distance geometry procedure based on random pairwise metrization with good sampling properties was used to generate a library of 500 possible structures for each of 11 small helical proteins. The input to distance geometry consisted of sets of restraints to enforce predicted helical secondary structure and a generic range of 5 to 11 Angstrom between predicted contact residues on all pairs of helices. For each of the 11. targets, the resulting library contained structures with low RMSD versus the native structure. Near-native sampling was enhanced by at least three orders of magnitude compared to a random sampling of compact folds. All library members were scored with a combination of an all-atom distance-dependent function, a residue pair-potential, and a hydrophobicity function. In six of the 11 cases, the best-ranking fold was considered to be near native. Each library was also reduced to a final ab initio prediction via consensus distance geometry performed over the 50 best-ranking structures from the full set of 500. The consensus results were of generally higher quality, yielding six predictions within 6.5 Angstrom of the native fold. These favorable predictions corresponded to those for which the correlation between the RMSD and the scoring function were highest. The advantage of the reported methodology is its extreme simplicity and potential for including other types of structural restraints. (C) 1999 Academic Press.
引用
收藏
页码:267 / 281
页数:15
相关论文
共 42 条
[1]   GLOBAL FOLD DETERMINATION FROM A SMALL NUMBER OF DISTANCE RESTRAINTS [J].
ASZODI, A ;
GRADWELL, MJ ;
TAYLOR, WR .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (02) :308-326
[2]   AN EVOLUTIONARY APPROACH TO FOLDING SMALL ALPHA-HELICAL PROTEINS THAT USES SEQUENCE INFORMATION AND AN EMPIRICAL GUIDING FITNESS FUNCTION [J].
BOWIE, JU ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (10) :4436-4440
[3]   A combinatorial distance-constraint approach to predicting protein tertiary models from known secondary structure [J].
Chelvanayagam, G ;
Knecht, L ;
Jenny, T ;
Benner, SA ;
Gonnet, GH .
FOLDING & DESIGN, 1998, 3 (03) :149-160
[4]   Using imperfect secondary structure predictions to improve molecular structure computations [J].
Chen, CC ;
Singh, JP ;
Altman, RB .
BIOINFORMATICS, 1999, 15 (01) :53-65
[5]   PROTEIN FOLDING - EVALUATION OF SOME SIMPLE RULES FOR THE ASSEMBLY OF HELICES INTO TERTIARY STRUCTURES WITH MYOGLOBIN AS AN EXAMPLE [J].
COHEN, FE ;
RICHMOND, TJ ;
RICHARDS, FM .
JOURNAL OF MOLECULAR BIOLOGY, 1979, 132 (03) :275-288
[6]   FOLDING PROTEIN ALPHA-CARBON CHAINS INTO COMPACT FORMS BY MONTE-CARLO METHODS [J].
COVELL, DG .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1992, 14 (03) :409-420
[7]  
Cui Y, 1998, PROTEINS, V31, P247, DOI 10.1002/(SICI)1097-0134(19980515)31:3<247::AID-PROT2>3.0.CO
[8]  
2-G
[9]   Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions [J].
Dandekar, T ;
Argos, P .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 256 (03) :645-660
[10]   THE MIDAS DISPLAY SYSTEM [J].
FERRIN, TE ;
HUANG, CC ;
JARVIS, LE ;
LANGRIDGE, R .
JOURNAL OF MOLECULAR GRAPHICS, 1988, 6 (01) :13-&