Fold independent structural comparisons of protein-ligand binding sites for exploring functional relationships

被引:80
作者
Gold, ND [1 ]
Jackson, RM [1 ]
机构
[1] Univ Leeds, Inst Mol & Cellular Biol, Leeds LS2 9JT, W Yorkshire, England
基金
英国生物技术与生命科学研究理事会;
关键词
geometric hashing; SitesBase; structural genomics; 3D structure comparison;
D O I
10.1016/j.jmb.2005.11.044
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1112 / 1124
页数:13
相关论文
共 47 条
[1]   Iterated profile searches with PSI-BLAST - a tool for discovery in protein databases [J].
Altschul, SF ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (11) :444-447
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   A GRAPH-THEORETIC APPROACH TO THE IDENTIFICATION OF 3-DIMENSIONAL PATTERNS OF AMINO-ACID SIDE-CHAINS IN PROTEIN STRUCTURES [J].
ARTYMIUK, PJ ;
POIRRETTE, AR ;
GRINDLEY, HM ;
RICE, DW ;
WILLETT, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (02) :327-344
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Inferring functional relationships of proteins from local sequence and spatial surface patterns [J].
Binkowski, TA ;
Adamian, L ;
Liang, J .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (02) :505-526
[6]   E-MSD: the European Bioinformatics Institute Macromolecular Structure Database [J].
Boutselakis, H ;
Dimitropoulos, D ;
Fillon, J ;
Golovin, A ;
Henrick, K ;
Hussain, A ;
Ionides, J ;
John, M ;
Keller, PA ;
Krissinel, E ;
McNeil, P ;
Naim, A ;
Newman, R ;
Oldfield, T ;
Pineda, J ;
Rachedi, A ;
Copeland, J ;
Sitnov, A ;
Sobhany, S ;
Suarez-Uruena, A ;
Swaminathan, J ;
Tagari, M ;
Tate, J ;
Tromm, S ;
Velankar, S ;
Vranken, W .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :458-462
[7]   Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: An automated all-against-all structural comparison using geometric matching [J].
Brakoulias, A ;
Jackson, RM .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (02) :250-260
[8]   FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[9]  
Carugo O, 1997, PROTEINS, V28, P29, DOI 10.1002/(SICI)1097-0134(199705)28:1<29::AID-PROT3>3.3.CO
[10]  
2-C