GRIP: A web-based system for constructing Gold Standard datasets for protein-protein interaction prediction

被引:22
作者
Browne, Fiona [1 ]
Wang, Haiying [1 ]
Zheng, Huiru [1 ]
Azuaje, Francisco [2 ]
机构
[1] Univ Ulster, Sch Comp & Math, Jordanstown, North Ireland
[2] Res Ctr Publ Hlth CRP Sante, Lab Cardiovasc Res, L-1445 Strassen, Luxembourg
关键词
D O I
10.1186/1751-0473-4-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Information about protein interaction networks is fundamental to understanding protein function and cellular processes. Interaction patterns among proteins can suggest new drug targets and aid in the design of new therapeutic interventions. Efforts have been made to map interactions on a proteomic-wide scale using both experimental and computational techniques. Reference datasets that contain known interacting proteins (positive cases) and non-interacting proteins (negative cases) are essential to support computational prediction and validation of protein-protein interactions. Information on known interacting and non interacting proteins are usually stored within databases. Extraction of these data can be both complex and time consuming. Although, the automatic construction of reference datasets for classification is a useful resource for researchers no public resource currently exists to perform this task. Results: GRIP (Gold Reference dataset constructor from Information on Protein complexes) is a web-based system that provides researchers with the functionality to create reference datasets for protein-protein interaction prediction in Saccharomyces cerevisiae. Both positive and negative cases for a reference dataset can be extracted, organised and downloaded by the user. GRIP also provides an upload facility whereby users can submit proteins to determine protein complex membership. A search facility is provided where a user can search for protein complex information in Saccharomyces cerevisiae. Conclusion: GRIP is developed to retrieve information on protein complex, cellular localisation, and physical and genetic interactions in Saccharomyces cerevisiae. Manual construction of reference datasets can be a time consuming process requiring programming knowledge. GRIP simplifies and speeds up this process by allowing users to automatically construct reference datasets.
引用
收藏
页数:7
相关论文
共 10 条
[1]  
Ashburner M, 2006, NUCLEIC ACIDS RES, V25, P25
[2]   Choosing negative examples for the prediction of protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
[3]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[4]  
Browne F, 2007, PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, P1365
[5]   Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae [J].
Collins, Sean R. ;
Kemmeren, Patrick ;
Zhao, Xue-Chu ;
Greenblatt, Jack F. ;
Spencer, Forrest ;
Holstege, Frank C. P. ;
Weissman, Jonathan S. ;
Krogan, Nevan J. .
MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (03) :439-450
[6]   A skeleton of the human protein interactome [J].
Ghavidel, A ;
Cagney, G ;
Emili, A .
CELL, 2005, 122 (06) :830-832
[7]   A Bayesian networks approach for predicting protein-protein interactions from genomic data [J].
Jansen, R ;
Yu, HY ;
Greenbaum, D ;
Kluger, Y ;
Krogan, NJ ;
Chung, SB ;
Emili, A ;
Snyder, M ;
Greenblatt, JF ;
Gerstein, M .
SCIENCE, 2003, 302 (5644) :449-453
[8]   Assessing the limits of genomic data integration for predicting protein networks [J].
Lu, LJ ;
Xia, Y ;
Paccanaro, A ;
Yu, HY ;
Gerstein, M .
GENOME RESEARCH, 2005, 15 (07) :945-953
[9]   MIPS:: analysis and annotation of proteins from whole genomes [J].
Mewes, HW ;
Amid, C ;
Arnold, R ;
Frishman, D ;
Güldener, U ;
Mannhaupt, G ;
Münsterkötter, M ;
Pagel, P ;
Strack, N ;
Stümpflen, V ;
Warfsmann, J ;
Ruepp, A .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D41-D44
[10]   Discovery of biological networks from diverse functional genomic data [J].
Myers, Chad L. ;
Robson, Drew ;
Wible, Adam ;
Hibbs, Matthew A. ;
Chiriac, Camelia ;
Theesfeld, Chandra L. ;
Dolinski, Kara ;
Troyanskaya, Olga G. .
GENOME BIOLOGY, 2005, 6 (13)