Functional analysis of the Escherichia coli genome for members of the α/β hydrolase family

被引:27
作者
Zhang, L [1 ]
Godzik, A [1 ]
Skolnick, J [1 ]
Fetrow, JS [1 ]
机构
[1] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
来源
FOLDING & DESIGN | 1998年 / 3卷 / 06期
关键词
fold prediction; functional genomics; function prediction; hydrolase family;
D O I
10.1016/S1359-0278(98)00069-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Database-searching methods based on sequence similarity have become the most commonly used tools for characterizing newly sequenced proteins. Due to the often underestimated functional diversity in protein families and superfamilies, however, it is difficult to make the characterization specific and accurate. In this work, we have extended a method for active-site identification from predicted protein structures. Results: The structural conservation and variation of the active sites of the alpha/beta hydrolases with known structures were studied. The similarities were incorporated into a three-dimensional motif that specifies essential requirements for the enzymatic functions. A threading algorithm was used to align 651 Escherichia coli open reading frames (ORFs) to one of the members of the alpha/beta hydrolase fold family. These ORFs were then screened according to our three-dimensional motif and with an extra requirement that demands conservation of the key active-site residues among the proteins that bear significant sequence similarity to the ORFs. 17 ORFs from E. coli were predicted to have hydrolase activity and their putative active-site residues were identified. Most were in agreement with the experiments and results of other database-searching methods. The study further suggests that YHET_ECOLI, a hypothetical protein classified as a member of the UPF0017 family (an uncharacterized protein family), bears all the hallmarks of the alpha/beta hydrolase family. Conclusions: The novel feature of our method is that it uses three-dimensional structural information for function prediction. The results demonstrate the importance and necessity of such a method to fill the gap between sequence alignment and function prediction; furthermore, the method provides a way to verify the structure predictions, which enables an expansion of the applicable scope of the threading algorithms.
引用
收藏
页码:535 / 548
页数:14
相关论文
共 46 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   PRINTS - A PROTEIN MOTIF FINGERPRINT DATABASE [J].
ATTWOOD, TK ;
BECK, ME .
PROTEIN ENGINEERING, 1994, 7 (07) :841-848
[4]   Novel developments with the PRINTS protein fingerprint database [J].
Attwood, TK ;
Beck, ME ;
Bleasby, AJ ;
Degtyarenko, K ;
Michie, AD ;
ParrySmith, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (01) :212-216
[5]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[6]   aCHEdb:: the database system for ESTHER, the α/β fold family of proteins and the Cholinesterase gene server [J].
Cousin, X ;
Hotelier, T ;
Giles, K ;
Toutant, JP ;
Chatonnet, A .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :226-228
[7]   THE CRYSTAL AND MOLECULAR-STRUCTURE OF THE RHIZOMUCOR-MIEHEI TRIACYLGLYCERIDE LIPASE AT 1.9-ANGSTROM RESOLUTION [J].
DEREWENDA, ZS ;
DEREWENDA, U ;
DODSON, GG .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 227 (03) :818-839
[8]   MOLECULAR EVOLUTION AND DOMAIN-STRUCTURE OF PLASMINOGEN-RELATED GROWTH-FACTORS (HGF/SF AND HGF1/MSP) [J].
DONATE, LE ;
GHERARDI, E ;
SRINIVASAN, N ;
SOWDHAMINI, R ;
APARICIO, S ;
BLUNDELL, TL .
PROTEIN SCIENCE, 1994, 3 (12) :2378-2394
[9]   Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases [J].
Fetrow, JS ;
Skolnick, J .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 281 (05) :949-968
[10]   Assigning amino acid sequences to 3-dimensional protein folds [J].
Fischer, D ;
Rice, D ;
Bowie, JU ;
Eisenberg, D .
FASEB JOURNAL, 1996, 10 (01) :126-136