Associating Genes and Protein Complexes with Disease via Network Propagation

被引:657
作者
Vanunu, Oron [1 ]
Magger, Oded [1 ]
Ruppin, Eytan [1 ]
Shlomi, Tomer [2 ]
Sharan, Roded [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
INTERACTOME; PRIORITIZATION; RESOURCE; DATABASE; PATHWAY; PHENOME;
D O I
10.1371/journal.pcbi.1000641
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A fundamental challenge in human health is the identification of disease-causing genes. Recently, several studies have tackled this challenge via a network-based approach, motivated by the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein or functional interactions. However, most of these approaches use only local network information in the inference process and are restricted to inferring single gene associations. Here, we provide a global, network-based method for prioritizing disease genes and inferring protein complex associations, which we call PRINCE. The method is based on formulating constraints on the prioritization function that relate to its smoothness over the network and usage of prior information. We exploit this function to predict not only genes but also protein complex associations with a disease of interest. We test our method on gene-disease association data, evaluating both the prioritization achieved and the protein complexes inferred. We show that our method outperforms extant approaches in both tasks. Using data on 1,369 diseases from the OMIM knowledgebase, our method is able (in a cross validation setting) to rank the true causal gene first for 34% of the diseases, and infer 139 disease-related complexes that are highly coherent in terms of the function, expression and conservation of their member proteins. Importantly, we apply our method to study three multi-factorial diseases for which some causal genes have been found already: prostate cancer, alzheimer and type 2 diabetes mellitus. PRINCE's predictions for these diseases highly match the known literature, suggesting several novel causal genes and protein complexes for further investigation.
引用
收藏
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2003, P NIPS 2003 VANC BC
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Becker KG, 2004, NAT GENET, V36, P431, DOI 10.1038/ng0504-431
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   From syndrome families to functional genomics [J].
Brunner, HG ;
van Driel, MA .
NATURE REVIEWS GENETICS, 2004, 5 (07) :545-551
[6]   The Fanconi Anemia/BRCA Signaling Pathway Disruption in Cisplatin-Sensitive Ovarian Cancers [J].
D'Andrea, Alan D. .
CELL CYCLE, 2003, 2 (04) :290-292
[7]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[8]   Large-scale mapping of human protein-protein interactions by mass spectrometry [J].
Ewing, Rob M. ;
Chu, Peter ;
Elisma, Fred ;
Li, Hongyan ;
Taylor, Paul ;
Climie, Shane ;
McBroom-Cerajewski, Linda ;
Robinson, Mark D. ;
O'Connor, Liam ;
Li, Michael ;
Taylor, Rod ;
Dharsee, Moyez ;
Ho, Yuen ;
Heilbut, Adrian ;
Moore, Lynda ;
Zhang, Shudong ;
Ornatsky, Olga ;
Bukhman, Yury V. ;
Ethier, Martin ;
Sheng, Yinglun ;
Vasilescu, Julian ;
Abu-Farha, Mohamed ;
Lambert, Jean-Philippe ;
Duewel, Henry S. ;
Stewart, Ian I. ;
Kuehl, Bonnie ;
Hogue, Kelly ;
Colwill, Karen ;
Gladwish, Katharine ;
Muskat, Brenda ;
Kinach, Robert ;
Adams, Sally-Lin ;
Moran, Michael F. ;
Morin, Gregg B. ;
Topaloglou, Thodoros ;
Figeys, Daniel .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[9]   Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes [J].
Franke, Lude ;
van Bakel, Harm ;
Fokkens, Like ;
de Jong, Edwin D. ;
Egmont-Petersen, Michael ;
Wijmenga, Cisca .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 78 (06) :1011-1025
[10]   Analysis of protein sequence and interaction data for candidate disease gene prediction [J].
George, Richard A. ;
Liu, Jason Y. ;
Feng, Lina L. ;
Bryson-Richardson, Robert J. ;
Fatkin, Diane ;
Wouters, Merridee A. .
NUCLEIC ACIDS RESEARCH, 2006, 34 (19)