Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions

被引:62
作者
Dandekar, T [1 ]
Argos, P [1 ]
机构
[1] EUROPEAN MOLEC BIOL LAB, D-69012 HEIDELBERG, GERMANY
关键词
genetic algorithm; protein folding; protein structure; beta-strands; tertiary fold prediction;
D O I
10.1006/jmbi.1996.0115
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Grid-free protein folding simulations based on sequence and secondary structure knowledge (using mostly experimentally determined secondary structure information but also analysing results from secondary structure predictions) were investigated using the genetic algorithm, a backbone representation, and standard dihedral angular conformations. Optimal structures are selected according to basic protein building principles. Having previously applied this approach to proteins with helical topology, we have now developed additional criteria and weights for beta-strand-containing proteins, validated them on four small beta-strand-rich proteins with different topologies, and tested the general performance of the method on many further examples from known protein structures with mixed secondary structural type and less than 100 amino acid residues. Topology predictions close to the observed experimental structures were obtained in four test cases together with fitness values that correlated with the similarity of the predicted topology to the observed structures. Root-mean-square deviation values of C-alpha atoms in the superposed predicted and observed structures, the latter of which had different topologies, were between 4.5 and 5.5 Angstrom (2.9 to 5.1 Angstrom without loops). Including 15 further protein examples with unique folds, root-mean-square deviation values ranged between 1.8 and 6.9 Angstrom with loop regions and averaged 5.3 Angstrom and 4.3 Angstrom, including and excluding loop regions, respectively. (C) 1996 Academic Press Limited
引用
收藏
页码:645 / 660
页数:16
相关论文
共 39 条
[1]   ASSESSMENT OF PROTEIN SECONDARY STRUCTURE PREDICTION METHODS BASED ON AMINO-ACID SEQUENCE [J].
ARGOS, P ;
SCHWARZ, J ;
SCHWARZ, J .
BIOCHIMICA ET BIOPHYSICA ACTA, 1976, 439 (02) :261-273
[2]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK, RECENT DEVELOPMENTS [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3093-3096
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   AN EVOLUTIONARY APPROACH TO FOLDING SMALL ALPHA-HELICAL PROTEINS THAT USES SEQUENCE INFORMATION AND AN EMPIRICAL GUIDING FITNESS FUNCTION [J].
BOWIE, JU ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (10) :4436-4440
[5]   STEREO VIEWING ON THE PC/AT WITH EGA GRAPHICS [J].
CHELVANAYAGAM, G ;
MCKEAIG, L .
JOURNAL OF MOLECULAR GRAPHICS, 1991, 9 (02) :111-114
[6]   PROTEIN-STRUCTURE - THE 14TH BARREL ROLLS OUT [J].
CHOTHIA, C .
NATURE, 1988, 333 (6174) :598-599
[7]   POTENTIAL OF GENETIC ALGORITHMS IN PROTEIN FOLDING AND PROTEIN ENGINEERING SIMULATIONS [J].
DANDEKAR, T ;
ARGOS, P .
PROTEIN ENGINEERING, 1992, 5 (07) :637-645
[8]   FOLDING THE MAIN-CHAIN OF SMALL PROTEINS WITH THE GENETIC ALGORITHM [J].
DANDEKAR, T ;
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 236 (03) :844-861
[9]   UNDERSTANDING HOW PROTEINS FOLD - THE LYSOZYME STORY SO FAR [J].
DOBSON, CM ;
EVANS, PA ;
RADFORD, SE .
TRENDS IN BIOCHEMICAL SCIENCES, 1994, 19 (01) :31-37
[10]   PROTEIN-STRUCTURE PREDICTION - RECOGNITION OF PRIMARY, SECONDARY, AND TERTIARY STRUCTURAL FEATURES FROM AMINO-ACID-SEQUENCE [J].
EISENHABER, F ;
PERSSON, B ;
ARGOS, P .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (01) :1-94