AVID: An integrative framework for discovering functional relationships among proteins

被引:29
作者
Jiang, TJ [1 ]
Keating, AE [1 ]
机构
[1] MIT, Dept Biol, Cambridge, MA 02139 USA
关键词
D O I
10.1186/1471-2105-6-136
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Determining the functions of uncharacterized proteins is one of the most pressing problems in the post-genomic era. Large scale protein-protein interaction assays, global mRNA expression analyses and systematic protein localization studies provide experimental information that can be used for this purpose. The data from such experiments contain many false positives and false negatives, but can be processed using computational methods to provide reliable information about protein-protein relationships and protein function. An outstanding and important goal is to predict detailed functional annotation for all uncharacterized proteins that is reliable enough to effectively guide experiments. Results: We present AVID, a computational method that uses a multi-stage learning framework to integrate experimental results with sequence information, generating networks reflecting functional similarities among proteins. We illustrate use of the networks by making predictions of detailed Gene Ontology ( GO) annotations in three categories: molecular function, biological process, and cellular component. Applied to the yeast Saccharomyces cerevisiae, AVID provides 37,451 pair-wise functional linkages between 4,191 proteins. These relationships are similar to 65-78% accurate, as assessed by cross-validation testing. Assignments of highly detailed functional descriptors to proteins, based on the networks, are estimated to be similar to 67% accurate for GO categories describing molecular function and cellular component and similar to 52% accurate for terms describing biological process. The predictions cover 1,490 proteins with no previous annotation in GO and also assign more detailed functions to many proteins annotated only with less descriptive terms. Predictions made by AVID are largely distinct from those made by other methods. Out of 37,451 predicted pair-wise relationships, the greatest number shared in common with another method is 3,413. Conclusion: AVID provides three networks reflecting functional associations among proteins. We use these networks to generate new, highly detailed functional predictions for roughly half of the yeast proteome that are reliable enough to drive targeted experimental investigations. The predictions suggest many specific, testable hypotheses. All of the data are available as downloadable files as well as through an interactive website at http://web.mit.edu/biology/keating/AVID. Thus, AVID will be a valuable resource for experimental biologists.
引用
收藏
页数:13
相关论文
共 35 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Yeast enhancer of Polycomb defines global Esal-dependent acetylation of chromatin [J].
Boudreault, AA ;
Cronier, D ;
Selleck, W ;
Lacoste, N ;
Utley, RT ;
Allard, SP ;
Savard, J ;
Lane, WS ;
Tan, S ;
Côté, J .
GENES & DEVELOPMENT, 2003, 17 (11) :1415-1428
[4]  
ELLSON J, GRAPHVIZ
[5]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[6]   Assigning function to yeast proteins by integration of technologies [J].
Hazbun, TR ;
Malmström, L ;
Anderson, S ;
Graczyk, BJ ;
Fox, B ;
Riffle, M ;
Sundin, BA ;
Aranda, JD ;
McDonald, WH ;
Chiu, CH ;
Snydsman, BE ;
Bradley, P ;
Muller, EGD ;
Fields, S ;
Baker, D ;
Yates, JR ;
Davis, TN .
MOLECULAR CELL, 2003, 12 (06) :1353-1365
[7]   Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry [J].
Ho, Y ;
Gruhler, A ;
Heilbut, A ;
Bader, GD ;
Moore, L ;
Adams, SL ;
Millar, A ;
Taylor, P ;
Bennett, K ;
Boutilier, K ;
Yang, LY ;
Wolting, C ;
Donaldson, I ;
Schandorff, S ;
Shewnarane, J ;
Vo, M ;
Taggart, J ;
Goudreault, M ;
Muskat, B ;
Alfarano, C ;
Dewar, D ;
Lin, Z ;
Michalickova, K ;
Willems, AR ;
Sassi, H ;
Nielsen, PA ;
Rasmussen, KJ ;
Andersen, JR ;
Johansen, LE ;
Hansen, LH ;
Jespersen, H ;
Podtelejnikov, A ;
Nielsen, E ;
Crawford, J ;
Poulsen, V ;
Sorensen, BD ;
Matthiesen, J ;
Hendrickson, RC ;
Gleeson, F ;
Pawson, T ;
Moran, MF ;
Durocher, D ;
Mann, M ;
Hogue, CWV ;
Figeys, D ;
Tyers, M .
NATURE, 2002, 415 (6868) :180-183
[8]   Global analysis of protein localization in budding yeast [J].
Huh, WK ;
Falvo, JV ;
Gerke, LC ;
Carroll, AS ;
Howson, RW ;
Weissman, JS ;
O'Shea, EK .
NATURE, 2003, 425 (6959) :686-691
[9]   Function prediction and protein networks [J].
Huynen, MA ;
Snel, B ;
von Mering, C ;
Bork, P .
CURRENT OPINION IN CELL BIOLOGY, 2003, 15 (02) :191-198
[10]   A comprehensive two-hybrid analysis to explore the yeast protein interactome [J].
Ito, T ;
Chiba, T ;
Ozawa, R ;
Yoshida, M ;
Hattori, M ;
Sakaki, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (08) :4569-4574