Integrating human omics data to prioritize candidate genes

被引:26
作者
Chen, Yong [1 ,2 ,3 ]
Wu, Xuebing [4 ,5 ]
Jiang, Rui [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, MOE Key Lab Bioinformat, Bioinformat Div, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Ctr Synthet & Syst Biol, TNLIST, Beijing 100084, Peoples R China
[3] Chinese Acad Sci, Inst Biophys, Beijing 100101, Peoples R China
[4] MIT, David H Koch Inst Integrat Canc Res, Cambridge, MA 02139 USA
[5] MIT, Computat & Syst Biol Grad Program, Cambridge, MA 02139 USA
来源
BMC MEDICAL GENOMICS | 2013年 / 6卷
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
GENOME-WIDE ASSOCIATION; NEUROPEPTIDE-Y; DISEASE GENES; SEMANTIC SIMILARITY; INSULIN-RESISTANCE; OBESITY; INTERACTOME; NETWORK; PHENOME; THERAPY;
D O I
10.1186/1755-8794-6-57
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. Methods: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. Results: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. Conclusion: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
引用
收藏
页数:12
相关论文
共 72 条
[61]   A text-mining analysis of the human phenome [J].
van Driel, MA ;
Bruggeman, J ;
Vriend, G ;
Brunner, HG ;
Leunissen, JA .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2006, 14 (05) :535-542
[62]   Associating Genes and Protein Complexes with Disease via Network Propagation [J].
Vanunu, Oron ;
Magger, Oded ;
Ruppin, Eytan ;
Shlomi, Tomer ;
Sharan, Roded .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (01)
[63]   The road to modularity [J].
Wagner, Gunter P. ;
Pavlicev, Mihaela ;
Cheverud, James M. .
NATURE REVIEWS GENETICS, 2007, 8 (12) :921-931
[64]   A new method to measure the semantic similarity of GO terms [J].
Wang, James Z. ;
Du, Zhidian ;
Payattakool, Rapeeporn ;
Yu, Philip S. ;
Chen, Chin-Fu .
BIOINFORMATICS, 2007, 23 (10) :1274-1281
[65]   Six new loci associated with body mass index highlight a neuronal influence on body weight regulation [J].
Willer, Cristen J. ;
Speliotes, Elizabeth K. ;
Loos, Ruth J. F. ;
Li, Shengxu ;
Lindgren, Cecilia M. ;
Heid, Iris M. ;
Berndt, Sonja I. ;
Elliott, Amanda L. ;
Jackson, Anne U. ;
Lamina, Claudia ;
Lettre, Guillaume ;
Lim, Noha ;
Lyon, Helen N. ;
McCarroll, Steven A. ;
Papadakis, Konstantinos ;
Qi, Lu ;
Randall, Joshua C. ;
Roccasecca, Rosa Maria ;
Sanna, Serena ;
Scheet, Paul ;
Weedon, Michael N. ;
Wheeler, Eleanor ;
Zhao, Jing Hua ;
Jacobs, Leonie C. ;
Prokopenko, Inga ;
Soranzo, Nicole ;
Tanaka, Toshiko ;
Timpson, Nicholas J. ;
Almgren, Peter ;
Bennett, Amanda ;
Bergman, Richard N. ;
Bingham, Sheila A. ;
Bonnycastle, Lori L. ;
Brown, Morris ;
Burtt, Noel L. P. ;
Chines, Peter ;
Coin, Lachlan ;
Collins, Francis S. ;
Connell, John M. ;
Cooper, Cyrus ;
Smith, George Davey ;
Dennison, Elaine M. ;
Deodhar, Parimal ;
Elliott, Paul ;
Erdos, Michael R. ;
Estrada, Karol ;
Evans, David M. ;
Gianniny, Lauren ;
Gieger, Christian ;
Gillson, Christopher J. .
NATURE GENETICS, 2009, 41 (01) :25-34
[66]   The genomic landscapes of human breast and colorectal cancers [J].
Wood, Laura D. ;
Parsons, D. Williams ;
Jones, Sian ;
Lin, Jimmy ;
Sjoblom, Tobias ;
Leary, Rebecca J. ;
Shen, Dong ;
Boca, Simina M. ;
Barber, Thomas ;
Ptak, Janine ;
Silliman, Natalie ;
Szabo, Steve ;
Dezso, Zoltan ;
Ustyanksky, Vadim ;
Nikolskaya, Tatiana ;
Nikolsky, Yuri ;
Karchin, Rachel ;
Wilson, Paul A. ;
Kaminker, Joshua S. ;
Zhang, Zemin ;
Croshaw, Randal ;
Willis, Joseph ;
Dawson, Dawn ;
Shipitsin, Michail ;
Willson, James K. V. ;
Sukumar, Saraswati ;
Polyak, Kornelia ;
Park, Ben Ho ;
Pethiyagoda, Charit L. ;
Pant, P. V. Krishna ;
Ballinger, Dennis G. ;
Sparks, Andrew B. ;
Hartigan, James ;
Smith, Douglas R. ;
Suh, Erick ;
Papadopoulos, Nickolas ;
Buckhaults, Phillip ;
Markowitz, Sanford D. ;
Parmigiani, Giovanni ;
Kinzler, Kenneth W. ;
Velculescu, Victor E. ;
Vogelstein, Bert .
SCIENCE, 2007, 318 (5853) :1108-1113
[67]   Network-based global inference of human disease genes [J].
Wu, Xuebing ;
Jiang, Rui ;
Zhang, Michael Q. ;
Li, Shao .
MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
[68]   Align human interactome with phenome to identify causative genes and networks underlying disease families [J].
Wu, Xuebing ;
Liu, Qifang ;
Jiang, Rui .
BIOINFORMATICS, 2009, 25 (01) :98-104
[69]   Validation of candidate causal genes for obesity that affect shared metabolic pathways and networks [J].
Yang, Xia ;
Deignan, Joshua L. ;
Qi, Hongxiu ;
Zhu, Jun ;
Qian, Su ;
Zhong, Judy ;
Torosyan, Gevork ;
Majid, Sana ;
Falkard, Brie ;
Kleinhanz, Robert R. ;
Karlsson, Jenny ;
Castellani, Lawrence W. ;
Mumick, Sheena ;
Wang, Kai ;
Xie, Tao ;
Coon, Michael ;
Zhang, Chunsheng ;
Estrada-Smith, Daria ;
Farber, Charles R. ;
Wang, Susanna S. ;
Van Nas, Atila ;
Ghazalpour, Anatole ;
Zhang, Bin ;
MacNeil, Douglas J. ;
Lamb, John R. ;
Dipple, Katrina M. ;
Reitman, Marc L. ;
Mehrabian, Margarete ;
Lum, Pek Y. ;
Schadt, Eric E. ;
Lusis, Aldons J. ;
Drake, Thomas A. .
NATURE GENETICS, 2009, 41 (04) :415-423
[70]   DomainRBF: a Bayesian regression approach to the prioritization of candidate domains for complex diseases [J].
Zhang, Wangshu ;
Chen, Yong ;
Sun, Fengzhu ;
Jiang, Rui .
BMC SYSTEMS BIOLOGY, 2011, 5