An in silico exploration of the neutral network in protein sequence space

被引:9
作者
Aita, T
Ota, M
Husimi, Y
机构
[1] Saitama Univ, Dept Funct Mat Sci, Urawa, Saitama 3388570, Japan
[2] Novartis Pharma KK, Tsukuba Res Inst, Tsukuba, Ibaraki 3002611, Japan
[3] Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan
[4] Natl Inst Genet, DDBJ, Mishima, Shizuoka 4118540, Japan
关键词
D O I
10.1006/jtbi.2003.3209
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Designating amino-acid sequences that fold into a common main-chain structure as "neutral sequences" for the structure, regardless of their function or stability, we investigated the distribution of neutral sequences in protein sequence space. For four distinct target structures (alpha, beta, alpha/beta and alpha + beta types) with the same chain length of 108, we generated the respective neutral sequences by using the inverse folding technique with a knowledge-based potential function. We assumed that neutral sequences for a protein structure have Z scores higher than or equal to fixed thresholds, where thresholds are defined as the Z score for the corresponding native sequence (case 1) or much greater Z score (case 2). An exploring walk simulation suggested that the neutral sequences mapped into the sequence space were connected with each other through straight neutral paths and formed an inherent neutral network over the sequence space. Through another exploring walk simulation, we investigated contiguous regions between or among the neutral networks for the distinct protein structures and obtained the following results. The closest approach distance between the two neutral networks ranged from 5 to 29 on the Hamming distance scale, showing a linear increase against the threshold values. The sequences located at the "interchange" regions between the two neutral networks have intermediate sequence-profile-scores for both corresponding structures. Introducing a "ball" in the sequence space that contains at least one neutral sequence for each of the four structures, we found that the minimal radius of the ball that is centered at an arbitrary position ranged from 35 to 50, while the minimal radius of the ball that is centered at a certain special position ranged from 20 to 30, in the Hamming distance scale. The relatively small Hamming distances (5-30) may support an evolution mechanism by transferring from a network for a structure to another network for a more beneficial structure via the interchange regions. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:599 / 613
页数:15
相关论文
共 41 条
[1]  
Agrafiotis DK, 1997, PROTEIN SCI, V6, P287
[2]   Theory of evolutionary molecular engineering through simultaneous accumulation of advantageous mutations [J].
Aita, T ;
Husimi, Y .
JOURNAL OF THEORETICAL BIOLOGY, 2000, 207 (04) :543-556
[3]   Statistical formulae of the energy distribution among a globular protein structure ensemble [J].
Aita, T ;
Husimi, Y .
JOURNAL OF THEORETICAL BIOLOGY, 2003, 220 (01) :107-121
[4]   Surveying a local fitness landscape of a protein with epistatic sites for the study of directed evolution [J].
Aita, T ;
Hamamatsu, N ;
Nomiya, Y ;
Uchiyama, H ;
Shibanaka, Y ;
Husimi, Y .
BIOPOLYMERS, 2002, 64 (02) :95-105
[5]   A cross-section of the fitness landscape of dihydrofolate reductase [J].
Aita, T ;
Iwakura, M ;
Husimi, Y .
PROTEIN ENGINEERING, 2001, 14 (09) :633-638
[6]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[7]   Neutral networks in protein space: a computational study based on knowledge-based potentials of mean force [J].
Babajide, A ;
Hofacker, IL ;
Sippl, MJ ;
Stadler, PF .
FOLDING & DESIGN, 1997, 2 (05) :261-269
[8]   Exploring protein sequence space using knowledge-based potentials [J].
Babajide, A ;
Farber, R ;
Hofacker, IL ;
Inman, J ;
Lapedes, AS ;
Stadler, PF .
JOURNAL OF THEORETICAL BIOLOGY, 2001, 212 (01) :35-46
[9]   Exploring the conformational properties of the sequence space between two proteins with different folds: An experimental study [J].
Blanco, FJ ;
Angrand, I ;
Serrano, L .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 285 (02) :741-753
[10]   Modeling evolutionary landscapes: Mutational stability, topology, and superfunnels in sequence space [J].
Bornberg-Bauer, E ;
Chan, HS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (19) :10689-10694