Computational prediction of native protein ligand-binding and enzyme active site sequences

被引:34
作者
Chakrabarti, R [1 ]
Klibanov, AM
Friesner, RA
机构
[1] Columbia Univ, Dept Chem, New York, NY 10027 USA
[2] Columbia Univ, Ctr Biomol Simulat, New York, NY 10027 USA
[3] MIT, Dept Chem, Cambridge, MA 02139 USA
[4] MIT, Div Biol Engn, Cambridge, MA 02139 USA
关键词
computational protein design; design enzyme; protein evolution;
D O I
10.1073/pnas.0504023102
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent studies reveal that the core sequences of many proteins were nearly optimized for stability by natural evolution. Surface residues, by contrast, are not so optimized, presumably because protein function is mediated through surface interactions with other molecules. Here, we sought to determine the extent to which the sequences of protein ligand-binding and enzyme active sites could be predicted by optimization of scoring functions based on protein ligand-binding affinity rather than structural stability. Optimization of binding affinity under constraints on the folding free energy correctly predicted 83% of amino acid residues (94% similar) in the binding sites of two model receptor-ligand complexes, streptavidin-biotin and glucose-binding protein. To explore the applicability of this methodology to enzymes, we applied an identical algorithm to the active sites of diverse enzymes from the peptidase, beta-gal, and nucleotide synthase families. Although simple optimization of binding affinity reproduced the sequences of some enzyme active sites with high precision, imposition of additional, geometric constraints on side-chain conformations based on the catalytic mechanism was required in other cases. With these modifications, our sequence optimization algorithm correctly predicted 78% of residues from all of the enzymes, with 83% similar to native (90% correct, with 95% similar, excluding residues with high variability in multiple sequence alignments). Furthermore, the conformations of the selected side chains were often correctly predicted within crystallographic error. These findings suggest that simple selection pressures may have played a predominant role in determining the sequences of ligand-binding and active sites in proteins.
引用
收藏
页码:10153 / 10158
页数:6
相关论文
共 22 条
[1]   Computer search algorithms in protein modification and design [J].
Desjarlais, JR ;
Clarke, ND .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (04) :471-475
[2]   THE DEAD-END ELIMINATION THEOREM AND ITS USE IN PROTEIN SIDE-CHAIN POSITIONING [J].
DESMET, J ;
DEMAEYER, M ;
HAZES, B ;
LASTERS, I .
NATURE, 1992, 356 (6369) :539-542
[3]   Glide: A new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy [J].
Friesner, RA ;
Banks, JL ;
Murphy, RB ;
Halgren, TA ;
Klicic, JJ ;
Mainz, DT ;
Repasky, MP ;
Knoll, EH ;
Shelley, M ;
Perry, JK ;
Shaw, DE ;
Francis, P ;
Shenkin, PS .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (07) :1739-1749
[4]   The SGB/NP hydration free energy model based on the surface generalized born solvent reaction field and novel nonpolar hydration free energy estimators [J].
Gallicchio, E ;
Zhang, LY ;
Levy, RM .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2002, 23 (05) :517-529
[5]   Generalized born model based on a surface integral formulation [J].
Ghosh, A ;
Rapp, CS ;
Friesner, RA .
JOURNAL OF PHYSICAL CHEMISTRY B, 1998, 102 (52) :10983-10990
[6]   Energy functions for protein design [J].
Gordon, DB ;
Marshall, SA ;
Mayo, SL .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (04) :509-513
[7]   ASN(177) IN ESCHERICHIA-COLI THYMIDYLATE SYNTHASE IS A MAJOR DETERMINANT OF PYRIMIDINE SPECIFICITY [J].
HARDY, LW ;
NALIVAIKA, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (20) :9725-9729
[8]   A hierarchical approach to all-atom protein loop prediction [J].
Jacobson, MP ;
Pincus, DL ;
Rapp, CS ;
Day, TJF ;
Honig, B ;
Shaw, DE ;
Friesner, RA .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (02) :351-367
[9]   Force field validation using protein side chain prediction [J].
Jacobson, MP ;
Kaminski, GA ;
Friesner, RA ;
Rapp, CS .
JOURNAL OF PHYSICAL CHEMISTRY B, 2002, 106 (44) :11673-11680
[10]   Folding free energy function selects native-like protein sequences in the core but not on the surface [J].
Jaramillo, A ;
Wernisch, L ;
Héry, S ;
Wodak, SJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (21) :13554-13559