DATABASE ALGORITHM FOR GENERATING PROTEIN BACKBONE AND SIDE-CHAIN COORDINATES FROM A C-ALPHA TRACE APPLICATION TO MODEL-BUILDING AND DETECTION OF COORDINATE ERRORS

被引:290
作者
HOLM, L
SANDER, C
机构
[1] EMBL, D-6900 Heidelberg
基金
芬兰科学院;
关键词
D O I
10.1016/0022-2836(91)90883-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The problem of constructing all-atom model co-ordinates of a protein from an outline of the polypeptide chain is encountered in protein structure determination by crystallography or nuclear magnetic resonance spectroscopy, in model building by homology and in protein design. Here, we present an automatic procedure for generating full protein co-ordinates (backbone and, optionally, side-chains) given the Cα trace and amino acid sequence. To construct backbones, a protein structure database is first scanned for fragments that locally fit the chain trace according to distance criteria. A best path algorithm then sifts through these segments and selects an optimal path with minimal mismatch at fragment joints. In blind tests, using fully known protein structures, backbones (Cα, C, N, O) can be reconstructed with a reliability of 0.4 to 0.6 Å root-mean-square position deviation and not more than 0 to 5% peptide flips. This accuracy is sufficient to identify possible errors in protein co-ordinate sets. To construct full co-ordinates, side-chains are added from a library of frequently occurring rotamers using a simple and fast Monte Carlo procedure with simulated annealing. In tests on X-ray structures determined at better than 2.5 Å resolution, the positions of side-chain atoms in the protein core (less than 20% relative accessibility) have an accuracy of 1.6 Å (r.m.s. deviation) and 70% of χ1 angles are within 30 ° of the X-ray structure. The computer program MaxSprout is available on request. © 1991.
引用
收藏
页码:183 / 194
页数:12
相关论文
共 31 条
[11]   ATOMIC-STRUCTURE OF THE ACTIN - DNASE-I COMPLEX [J].
KABSCH, W ;
MANNHERZ, HG ;
SUCK, D ;
PAI, EF ;
HOLMES, KC .
NATURE, 1990, 347 (6288) :37-44
[12]   DISCUSSION OF SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1978, 34 (SEP) :827-828
[13]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[14]   STRUCTURE OF UNLIGATED ASPARTATE CARBAMOYLTRANSFERASE OF ESCHERICHIA-COLI AT 2.6-A RESOLUTION [J].
KE, HM ;
HONZATKO, RB ;
LIPSCOMB, WN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (13) :4037-4040
[15]   STRUCTURAL ASYMMETRY IN THE CTP-LIGANDED FORM OF ASPARTATE CARBAMOYLTRANSFERASE FROM ESCHERICHIA-COLI [J].
KIM, KH ;
PAN, ZX ;
HONZATKO, RB ;
KE, HM ;
LIPSCOMB, WN .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (04) :853-875
[16]   OPTIMIZATION BY SIMULATED ANNEALING [J].
KIRKPATRICK, S ;
GELATT, CD ;
VECCHI, MP .
SCIENCE, 1983, 220 (4598) :671-680
[17]   EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES [J].
METROPOLIS, N ;
ROSENBLUTH, AW ;
ROSENBLUTH, MN ;
TELLER, AH ;
TELLER, E .
JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) :1087-1092
[18]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[19]  
PIERROT M, 1982, J BIOL CHEM, V257, P4341
[20]   TERTIARY TEMPLATES FOR PROTEINS - USE OF PACKING CRITERIA IN THE ENUMERATION OF ALLOWED SEQUENCES FOR DIFFERENT STRUCTURAL CLASSES [J].
PONDER, JW ;
RICHARDS, FM .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :775-791