Fast tree search for enumeration of a lattice model of protein folding

被引:22
作者
Cejtin, H
Edler, J
Gottlieb, A
Helling, R
Li, H
Philbin, J
Wingreen, N
Tang, C
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
[2] NYU, Dept Comp Sci, New York, NY 10003 USA
关键词
D O I
10.1063/1.1423324
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Using a fast tree-searching algorithm and a Pentium cluster, we enumerated all the sequences and compact conformations (structures) for a protein folding model on a cubic lattice of size 4x3x3. We used two types of amino acids-hydrophobic (H) and polar (P)-to make up the sequences, so there were 2(36)approximate to6.87x10(10) different sequences. The total number of distinct structures was 84 731 192. We made use of a simple solvation model in which the energy of a sequence folded into a structure is minus the number of hydrophobic amino acids in the "core" of the structure. For every sequence, we found its ground state or ground states, i.e., the structure or structures for which its energy is lowest. About 0.3% of the sequences have a unique ground state. The number of structures that are unique ground states of at least one sequence is 2 662 050, about 3% of the total number of structures. However, these "designable" structures differ drastically in their designability, defined as the number of sequences whose unique ground state is that structure. To understand this variation in designability, we studied the distribution of structures in a high dimensional space in which each structure is represented by a string of 1's and 0's, denoting core and surface sites, respectively. (C) 2002 American Institute of Physics.
引用
收藏
页码:352 / 359
页数:8
相关论文
共 37 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]   MINIMUM ENERGY COMPACT STRUCTURES OF RANDOM SEQUENCES OF HETEROPOLYMERS [J].
CAMACHO, CJ ;
THIRUMALAI, D .
PHYSICAL REVIEW LETTERS, 1993, 71 (15) :2505-2508
[3]   ENERGY LANDSCAPES AND THE COLLAPSE DYNAMICS OF HOMOPOLYMERS [J].
CHAN, HS ;
DILL, KA .
JOURNAL OF CHEMICAL PHYSICS, 1993, 99 (03) :2116-2127
[4]   TRANSITION-STATES AND FOLDING DYNAMICS OF PROTEINS AND HETEROPOLYMERS [J].
CHAN, HS ;
DILL, KA .
JOURNAL OF CHEMICAL PHYSICS, 1994, 100 (12) :9238-9257
[5]  
CHAN HS, 1991, ANNU REV BIOPHYS BIO, V20, P447, DOI 10.1146/annurev.bb.20.060191.002311
[6]  
Creighton T. E., 1992, PROTEIN FOLDING
[7]   DOMINANT FORCES IN PROTEIN FOLDING [J].
DILL, KA .
BIOCHEMISTRY, 1990, 29 (31) :7133-7155
[8]   THEORY FOR THE FOLDING AND STABILITY OF GLOBULAR-PROTEINS [J].
DILL, KA .
BIOCHEMISTRY, 1985, 24 (06) :1501-1509
[9]   SOLVATION ENERGY IN PROTEIN FOLDING AND BINDING [J].
EISENBERG, D ;
MCLACHLAN, AD .
NATURE, 1986, 319 (6050) :199-203
[10]  
EMBERLY EG, IN PRESS PROTEINS