Localization of binding sites in protein structures by optimization of a composite scoring function

被引:22
作者
Rossi, Andrea
Marti-Renom, Marc A.
Sali, Andrej
机构
[1] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
关键词
protein function annotation; small ligand binding-site localization;
D O I
10.1110/ps.062247506
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rise in the number of functionally uncharacterized protein structures is increasing the demand for structure-based methods for functional annotation. Here, we describe a method for predicting the location of a binding site of a given type on a target protein structure. The method begins by constructing a scoring function, followed by a Monte Carlo optimization, to find a good scoring patch on the protein surface. The scoring function is a weighted linear combination of the z-scores of various properties of protein structure and sequence, including amino acid residue conservation, compactness, protrusion, convexity, rigidity, hydrophobicity, and charge density; the weights are calculated from a set of previously identified instances of the binding-site type on known protein structures. The scoring function can easily incorporate different types of information useful in localization, thus increasing the applicability and accuracy of the approach. To test the method, 1008 known protein structures were split into 20 different groups according to the type of the bound ligand. For nonsugar ligands, such as various nucleotides, binding sites were correctly identified in 55% - 73% of the cases. The method is completely automated (http://salilab.org/patcher) and can be applied on a large scale in a structural genomics setting.
引用
收藏
页码:2366 / 2380
页数:15
相关论文
共 68 条
[1]   Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information [J].
Ahmad, S ;
Gromiha, MM ;
Sarai, A .
BIOINFORMATICS, 2004, 20 (04) :477-486
[2]  
Alberts B., 2002, Molecular Biology of The Cell, V4th
[3]   Automated structure-based prediction of functional sites in proteins: Applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking [J].
Aloy, P ;
Querol, E ;
Aviles, FX ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 311 (02) :395-408
[4]   ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information [J].
Armon, A ;
Graur, D ;
Ben-Tal, N .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) :447-463
[5]   A GRAPH-THEORETIC APPROACH TO THE IDENTIFICATION OF 3-DIMENSIONAL PATTERNS OF AMINO-ACID SIDE-CHAINS IN PROTEIN STRUCTURES [J].
ARTYMIUK, PJ ;
POIRRETTE, AR ;
GRINDLEY, HM ;
RICE, DW ;
WILLETT, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (02) :327-344
[6]   Definitions of enzyme function for the structural genomics era [J].
Babbitt, PC .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2003, 7 (02) :230-237
[7]  
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3583
[8]   Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[9]   Fast prediction and visualization of protein binding pockets with PASS [J].
Brady, GP ;
Stouten, PFW .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2000, 14 (04) :383-401
[10]   Target selection for structural genomics [J].
Brenner, SE .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :967-969