A new bioinformatic approach to detect common 3D sites in protein structures

被引:120
作者
Jambon, M
Imberty, A
Deléage, G
Geourjon, C
机构
[1] Inst Biol & Chim Prot, Lab Bioinformat & RMN Struct, F-69367 Lyon, France
[2] Univ Grenoble 1, Ctr Rech Macromol Vegetales, CERMAV, Grenoble, France
关键词
bioinformatics; 3D structure of proteins; user-defined chemical groups; detection of similar 3D functional sites;
D O I
10.1002/prot.10339
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An innovative bioinformatic method has been designed and implemented to detect similar three-dimensional (3D) sites in proteins. This approach allows the comparison of protein structures or substructures and detects local spatial similarities: this method is completely independent from the amino acid sequence and from the backbone structure. In contrast to already existing tools, the basis for this method is a representation of the protein structure by a set of stereochemical groups that are defined independently from the notion of amino acid. An efficient heuristic for finding similarities that uses graphs of triangles of chemical groups to represent the protein structures has been developed. The implementation of this heuristic constitutes a software named SuMo (Surfing the Molecules), which allows the dynamic definition of chemical groups, the selection of sites in the proteins, and the management and screening of databases. To show the relevance of this approach, we focused on two extreme examples illustrating convergent and divergent evolution. In two unrelated serine proteases, SuMo detects one common site, which corresponds to the catalytic triad. In the legume lectins family composed of >100 structures that share similar sequences and folds but may have lost their ability to bind a carbohydrate molecule, SuMo discriminates between functional and non-functional lectins with a selectivity of 96%. The time needed for searching a given site in a protein structure is typically 0.1 s on a PIII 800MHz/Linux computer; thus, in further studies, SuMo will be used to screen the PDB. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:137 / 145
页数:9
相关论文
共 31 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   PROSITE - A DICTIONARY OF SITES AND PATTERNS IN PROTEINS [J].
BAIROCH, A .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2241-2245
[3]  
BLANCHET C, 1999, LOGICIEL MPSA RESSOU
[4]   The structural features of concanavalin A governing non-proline peptide isomerization [J].
Bouckaert, J ;
Dewallef, Y ;
Poortmans, F ;
Wyns, L ;
Loris, R .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (26) :19778-19787
[5]  
CHAILLOUX E, 2000, DEV APPL OBJECTIVE C
[6]   AN EFFICIENT AUTOMATED COMPUTER VISION BASED TECHNIQUE FOR DETECTION OF 3-DIMENSIONAL STRUCTURAL MOTIFS IN PROTEINS [J].
FISCHER, D ;
BACHAR, O ;
NUSSINOV, R ;
WOLFSON, H .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1992, 9 (04) :769-789
[7]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358
[8]   Touring protein fold space with Dali/FSSP [J].
Holm, L ;
Sander, C .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :316-319
[9]   PROTEIN-STRUCTURE COMPARISON BY ALIGNMENT OF DISTANCE MATRICES [J].
HOLM, L ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 233 (01) :123-138
[10]   Dali/FSSP classification of three-dimensional protein folds [J].
Holm, L ;
Sander, C .
NUCLEIC ACIDS RESEARCH, 1997, 25 (01) :231-234