Enhanced functional annotation of protein sequences via the use of structural descriptors

被引:38
作者
Di Gennaro, JA
Siew, N
Hoffman, BT
Zhang, L
Skolnick, J
Neilson, LI
Fetrow, JS
机构
[1] GeneFormat Inc, San Diego, CA 92121 USA
[2] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
[3] Danforth Plant Sci Ctr, Lab Computat Genomes, St Louis, MO 63141 USA
关键词
Bacillus subtilis; bioinformatics; disulfide oxidoreductase; Drosophila melanogaster; FFF (fuzzy functional form); functional annotation; protein function prediction; protein tyrosine phosphatase; structural genomics;
D O I
10.1006/jsbi.2001.4391
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In order to circumvent limitations of sequence based methods in the process of making functional predictions for proteins, we have developed a methodology that uses a sequence-to-structure-to-function paradigm. First, an approximate three-dimensional structure is predicted. Then, a three-dimensional descriptor of the functional site, termed a Fuzzy Functional Form, or FFF, is used to screen the structure for the presence of the functional site of interest (Fetrow et al., 1998; Fetrow and Skolnick, 1998). Previously, a disulfide oxidoreductase FFF was developed and applied to predicted structures obtained from a small structural database. Here, using a substantially larger structural database, we expand the analysis of the disulfide oxidoreductase FFF to the B. subtilis genome. To ascertain the performance of the FFF, its results are compared to those obtained using both the sequence alignment method BLAST and three local sequence motif databases: PRINTS, Prosite, and Blocks. The FFF method is then compared in detail to Blocks and it is shown that the FFF is more flexible and sensitive in finding a specific function in a set of unknown proteins. In addition, the estimated false positive rate of function prediction is significantly lower using the FFF structural motif, rather than the standard sequence motif methods. We also present a second FFF and describe a specific example of the results of its whole-genome application to D. melanogaster using a newer threading algorithm. Our results from all of these studies indicate that the addition of three-dimensional structural in-formation adds significant value in the prediction of biochemical function of genomic sequences. (C) 2001 Academic Press.
引用
收藏
页码:232 / 245
页数:14
相关论文
共 67 条
[61]   Crystal structures of reduced, oxidized, and mutated human thioredoxins: Evidence for a regulatory homodimer [J].
Weichsel, A ;
Gasdaska, JR ;
Powis, G ;
Montfort, WR .
STRUCTURE, 1996, 4 (06) :735-751
[62]  
XIA TH, 1992, PROTEIN SCI, V1, P310
[63]  
Xu HM, 1999, NAT STRUCT BIOL, V6, P750
[64]   Crystal structure of the dual specificity protein phosphatase VHR [J].
Yuvaniyama, J ;
Denu, JM ;
Dixon, JE ;
Saper, MA .
SCIENCE, 1996, 272 (5266) :1328-1331
[65]   Functional analysis of the Escherichia coli genome for members of the α/β hydrolase family [J].
Zhang, L ;
Godzik, A ;
Skolnick, J ;
Fetrow, JS .
FOLDING & DESIGN, 1998, 3 (06) :535-548
[66]   Crystal structure of a human low molecular weight phosphotyrosyl phosphatase - Implications for substrate specificity [J].
Zhang, M ;
Stauffacher, CV ;
Lin, DY ;
Van Etten, RL .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (34) :21714-21720
[67]   Protein sequence similarity searches using patterns as seeds [J].
Zhang, Z ;
Schaffer, AA ;
Miller, W ;
Madden, TL ;
Lipman, DJ ;
Koonin, EV ;
Altschul, SF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (17) :3986-3990