De novo protein design.: II.: Plasticity in sequence space

被引：61

作者：

Koehl, P ^{[1
]}

Levitt, M ^{[1
]}

机构：

[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA

来源：

JOURNAL OF MOLECULAR BIOLOGY | 1999年 / 293卷 / 05期

关键词：

protein design; random energy model; sequence space; Monte Carlo; fold recognition;

D O I：

10.1006/jmbi.1999.3212

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

It is generally accepted that many different protein sequences have similar folded structures, and that there is a relatively high probability that a new sequence possesses a previously observed fold. An indirect consequence of this is that protein design should define the sequence space accessible to a given structure, rather than providing a single optimized sequence. We have recently developed a new approach for protein sequence design, which optimizes the complete sequence of a protein based on the knowledge of its backbone structure, its amino acid composition and a physical energy function including van der Waals interactions, electrostatics, and environment free energy. The specificity of the designed sequence for its template backbone is imposed by keeping the amino acid composition fixed. Here, we show that our procedure converges in sequence space, albeit not to the native sequence of the protein. We observe that while polar residues are well conserved in our designed sequences, non-polar amino acids at the surface of a protein are often replaced by polar residues. The designed sequences provide a multiple alignment of sequences that all adopt the same three-dimensional fold. This alignment is used to derive a profile matrix for chicken triose phosphate isomerase, TIM. The matrix is found to recognize significantly the native sequence for TIM, as well as closely related sequences. Possible application of this approach to protein fold recognition is discussed. (C) 1999 Academic Press.

引用

页码：1183 / 1193

页数：11

共 60 条

[1]

Agrafiotis DK, 1997, PROTEIN SCI, V6, P287

[2] THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].