Integrating human omics data to prioritize candidate genes

被引:26
作者
Chen, Yong [1 ,2 ,3 ]
Wu, Xuebing [4 ,5 ]
Jiang, Rui [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, MOE Key Lab Bioinformat, Bioinformat Div, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Ctr Synthet & Syst Biol, TNLIST, Beijing 100084, Peoples R China
[3] Chinese Acad Sci, Inst Biophys, Beijing 100101, Peoples R China
[4] MIT, David H Koch Inst Integrat Canc Res, Cambridge, MA 02139 USA
[5] MIT, Computat & Syst Biol Grad Program, Cambridge, MA 02139 USA
来源
BMC MEDICAL GENOMICS | 2013年 / 6卷
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
GENOME-WIDE ASSOCIATION; NEUROPEPTIDE-Y; DISEASE GENES; SEMANTIC SIMILARITY; INSULIN-RESISTANCE; OBESITY; INTERACTOME; NETWORK; PHENOME; THERAPY;
D O I
10.1186/1755-8794-6-57
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. Methods: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. Results: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. Conclusion: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
引用
收藏
页数:12
相关论文
共 72 条
[1]   Speeding disease gene discovery by sequence based candidate prioritization [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]   Gene prioritization through genomic data fusion [J].
Aerts, S ;
Lambrechts, D ;
Maity, S ;
Van Loo, P ;
Coessens, B ;
De Smet, F ;
Tranchevent, LC ;
De Moor, B ;
Marynen, P ;
Hassan, B ;
Carmeliet, P ;
Moreau, Y .
NATURE BIOTECHNOLOGY, 2006, 24 (05) :537-544
[3]   TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis [J].
Aerts, S ;
Van Loo, P ;
Thijs, G ;
Mayer, H ;
de Martin, R ;
Moreau, Y ;
De Moor, B .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W393-W396
[4]   PathwayVoyager: pathway mapping using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [J].
Altermann, E ;
Klaenhammer, TR .
BMC GENOMICS, 2005, 6 (1)
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   Network medicine -: From obesity to the "Diseasome'' [J].
Barabasi, Albert-Laszlo .
NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (04) :404-407
[7]   Metabolic syndrome: From global epidemiology to individualized medicine [J].
Batsis, J. A. ;
Nieto-Martinez, R. E. ;
Lopez-Jimenez, F. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2007, 82 (05) :509-524
[8]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[9]   Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease [J].
Botstein, D ;
Risch, N .
NATURE GENETICS, 2003, 33 (Suppl 3) :228-237
[10]   Five weeks of insulin-like growth factor-I treatment does not alter glucose kinetics or insulin sensitivity during a hyperglycemic clamp in older women [J].
Braun, B ;
Friedlander, AL ;
Pollack, M ;
Butterfield, GE ;
Marcus, R ;
Hoffman, AR .
METABOLISM-CLINICAL AND EXPERIMENTAL, 2003, 52 (09) :1182-1190