PRINCESS, a protein interaction confidence evaluation system with multiple data sources

被引:49
作者
Li, Dong [1 ]
Liu, Wanlin [1 ]
Liu, Zhongyang [1 ]
Wang, Jian [1 ]
Liu, Qijun [1 ]
Zhu, Yunping [1 ]
He, Fuchu [1 ]
机构
[1] Beijing Inst Radiat Med, Beijing Proteome Res Ctr, State Key Lab Proteom, Beijing 100850, Peoples R China
关键词
D O I
10.1074/mcp.M700287-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Advances in proteomics technologies have enabled novel protein interactions to be detected at high speed, but they come at the expense of relatively low quality. Therefore, a crucial step in utilizing the high throughput protein interaction data is evaluating their confidence and then separating the subsets of reliable interactions from the background noise for further analyses. Using Bayesian network approaches, we combine multiple heterogeneous biological evidences, including model organism protein-protein interaction, interaction domain, functional annotation, gene expression, genome context, and network topology structure, to assign reliability to the human protein-protein interactions identified by high throughput experiments. This method shows high sensitivity and specificity to predict true interactions from the human high throughput protein-protein interaction data sets. This method has been developed into an on-line confidence scoring system specifically for the human high throughput protein-protein interactions. Users may submit their protein-protein interaction data on line, and the detailed information about the supporting evidence for query interactions together with the confidence scores will be returned. The Web interface of PRINCESS (protein interaction confidence evaluation system with multiple data sources) is available at the website of China Human Proteome Organisation.
引用
收藏
页码:1043 / 1052
页数:10
相关论文
共 50 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Gaining confidence in high-throughput protein interaction networks [J].
Bader, JS ;
Chaudhuri, A ;
Rothberg, JM ;
Chant, J .
NATURE BIOTECHNOLOGY, 2004, 22 (01) :78-85
[3]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[4]   Molecular networks: The top-down view [J].
Bray, D .
SCIENCE, 2003, 301 (5641) :1864-1865
[5]   Protein interactions - Two methods for assessment of the reliability of high throughput observations [J].
Deane, CM ;
Salwinski, L ;
Xenarios, I ;
Eisenberg, D .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (05) :349-356
[6]   What is Bayesian statistics? [J].
Eddy, SR .
NATURE BIOTECHNOLOGY, 2004, 22 (09) :1177-1178
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]  
Frank E, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P1305, DOI 10.1007/0-387-25465-X_62
[9]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[10]   A protein interaction map of Drosophila melanogaster [J].
Giot, L ;
Bader, JS ;
Brouwer, C ;
Chaudhuri, A ;
Kuang, B ;
Li, Y ;
Hao, YL ;
Ooi, CE ;
Godwin, B ;
Vitols, E ;
Vijayadamodar, G ;
Pochart, P ;
Machineni, H ;
Welsh, M ;
Kong, Y ;
Zerhusen, B ;
Malcolm, R ;
Varrone, Z ;
Collis, A ;
Minto, M ;
Burgess, S ;
McDaniel, L ;
Stimpson, E ;
Spriggs, F ;
Williams, J ;
Neurath, K ;
Ioime, N ;
Agee, M ;
Voss, E ;
Furtak, K ;
Renzulli, R ;
Aanensen, N ;
Carrolla, S ;
Bickelhaupt, E ;
Lazovatsky, Y ;
DaSilva, A ;
Zhong, J ;
Stanyon, CA ;
Finley, RL ;
White, KP ;
Braverman, M ;
Jarvie, T ;
Gold, S ;
Leach, M ;
Knight, J ;
Shimkets, RA ;
McKenna, MP ;
Chant, J ;
Rothberg, JM .
SCIENCE, 2003, 302 (5651) :1727-1736