Neutral networks in protein space: a computational study based on knowledge-based potentials of mean force

被引:80
作者
Babajide, A
Hofacker, IL
Sippl, MJ
Stadler, PF
机构
[1] UNIV VIENNA,INST THEORET CHEM,A-1090 VIENNA,AUSTRIA
[2] UNIV ILLINOIS,BECKMAN INST,URBANA,IL 61801
[3] SALZBURG UNIV,CAME,A-5020 SALZBURG,AUSTRIA
[4] SANTA FE INST,SANTA FE,NM 87501
来源
FOLDING & DESIGN | 1997年 / 2卷 / 05期
基金
奥地利科学基金会;
关键词
inverse folding; knowledge-based potentials; neutral mutations; protein design; protein evolution; protein folding;
D O I
10.1016/S1359-0278(97)00037-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many protein sequences, often unrelated, adopt similar folds. Sequences folding into the same shape thus form subsets of sequence space. The shape and the connectivity of these sets have implications for protein evolution and de novo design. Results: We investigate the topology of these sets for some proteins with known three-dimensional structure using inverse folding techniques. First, we find that sequences adopting a given fold do not cluster in sequence space and that there is no detectable sequence homology among them. Nevertheless, these sequences are connected in the sense that there exists a path such that every sequence can be reached from every other sequence while the fold remains unchanged. We find similar results for restricted amino acid alphabets in some cases (e.g. ADLG). In other cases, it seems impossible to find sequences with native-like behavior (e.g. QLR). These findings seem to be independent of the particular structure considered. Conclusions: Amino acid sequences folding into a common shape are distributed homogeneously in sequence space. Hence, the connectivity of the set of these sequences implies the existence of very long neutral paths on all examined protein structures. Regarding protein design, these results imply that sequences with more or less arbitrary chemical properties can be attached to a given structural framework. But we also observe that designability varies significantly among native structures. These features of protein sequence space are similar to what has been found for nucleic acids.
引用
收藏
页码:261 / 269
页数:9
相关论文
共 53 条
[1]   A CANONICAL DECOMPOSITION-THEORY FOR METRICS ON A FINITE-SET [J].
BANDELT, HJ ;
DRESS, AWM .
ADVANCES IN MATHEMATICS, 1992, 92 (01) :47-105
[2]   AN IMPROVED PAIR POTENTIAL TO RECOGNIZE NATIVE PROTEIN FOLDS [J].
BAUER, A ;
BEYER, A .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (03) :254-261
[3]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[4]   IDENTIFICATION OF PROTEIN FOLDS - MATCHING HYDROPHOBICITY PATTERNS OF SEQUENCE SETS WITH SOLVENT ACCESSIBILITY PATTERNS OF KNOWN STRUCTURES [J].
BOWIE, JU ;
CLARKE, ND ;
PABO, CO ;
SAUER, RT .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (03) :257-264
[5]   STRUCTURE-DERIVED HYDROPHOBIC POTENTIAL - HYDROPHOBIC POTENTIAL DERIVED FROM X-RAY STRUCTURES OF GLOBULAR-PROTEINS IS ABLE TO IDENTIFY NATIVE FOLDS [J].
CASARI, G ;
SIPPL, MJ .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 224 (03) :725-732
[6]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[7]   FOLDED PROTEINS OCCUR FREQUENTLY IN LIBRARIES OF RANDOM AMINO-ACID-SEQUENCES [J].
DAVIDSON, AR ;
SAUER, RT .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (06) :2146-2150
[8]  
DILL KA, 1995, PROTEIN SCI, V4, P561
[9]   MOLECULAR ENGINEERING - AN APPROACH TO THE DEVELOPMENT OF GENERAL CAPABILITIES FOR MOLECULAR MANIPULATION [J].
DREXLER, KE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-PHYSICAL SCIENCES, 1981, 78 (09) :5275-5278
[10]   WHY DO PROTEIN ARCHITECTURES HAVE BOLTZMANN-LIKE STATISTICS [J].
FINKELSTEIN, AV ;
BADRETDINOV, AY ;
GUTIN, AM .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (02) :142-150