Causal reasoning on biological networks: interpreting transcriptional changes

被引:105
作者
Chindelevitch, Leonid [1 ]
Ziemek, Daniel [1 ]
Enayetallah, Ahmed [2 ]
Randhawa, Ranjit [1 ]
Sidders, Ben [3 ]
Brockel, Christoph [4 ]
Huang, Enoch S. [1 ]
机构
[1] Pfizer Worldwide Res & Dev, Computat Sci Ctr Emphasis, Cambridge, MA 02140 USA
[2] Pfizer Worldwide Med Chem, Compound Safety Predict, Groton, CT 06340 USA
[3] Pfizer Worldwide Res & Dev, Neusentis, Cambridge CB21 6GS, England
[4] Pfizer Business Technol, Translat & Bioinformat, Cambridge, MA 02140 USA
关键词
CARDIAC FIBROSIS; GENE-EXPRESSION; CARDIOMYOPATHY; MICE;
D O I
10.1093/bioinformatics/bts090
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: The interpretation of high-throughput datasets has remained one of the central challenges of computational biology over the past decade. Furthermore, as the amount of biological knowledge increases, it becomes more and more difficult to integrate this large body of knowledge in a meaningful manner. In this article, we propose a particular solution to both of these challenges. Methods: We integrate available biological knowledge by constructing a network of molecular interactions of a specific kind: causal interactions. The resulting causal graph can be queried to suggest molecular hypotheses that explain the variations observed in a high-throughput gene expression experiment. We show that a simple scoring function can discriminate between a large number of competing molecular hypotheses about the upstream cause of the changes observed in a gene expression profile. We then develop an analytical method for computing the statistical significance of each score. This analytical method also helps assess the effects of random or adversarial noise on the predictive power of our model. Results: Our results show that the causal graph we constructed from known biological literature is extremely robust to random noise and to missing or spurious information. We demonstrate the power of our causal reasoning model on two specific examples, one from a cancer dataset and the other from a cardiac hypertrophy experiment. We conclude that causal reasoning models provide a valuable addition to the biologist's toolkit for the interpretation of gene expression data.
引用
收藏
页码:1114 / 1121
页数:8
相关论文
共 19 条
[1]
[Anonymous], 1996, SANKHYA INDIAN J S A
[2]
Oxidative stress triggers cardiac fibrosis in the heart of diabetic rats [J].
Aragno, Manuela ;
Mastrocola, Raffaella ;
Alloatti, Giuseppe ;
Vercellinatto, Ilenia ;
Bardini, Paola ;
Geuna, Stefano ;
Catalano, Maria Graziella ;
Danni, Oliviero ;
Boccuzzi, Giuseppe .
ENDOCRINOLOGY, 2008, 149 (01) :380-388
[3]
Repression of the Arf tumor suppressor by E2F3 is required for normal cell cycle kinetics [J].
Aslanian, A ;
Iaquinta, PJ ;
Verona, R ;
Lees, JA .
GENES & DEVELOPMENT, 2004, 18 (12) :1413-1422
[4]
Oncogenic pathway signatures in human cancers as a guide to targeted therapies [J].
Bild, AH ;
Yao, G ;
Chang, JT ;
Wang, QL ;
Potti, A ;
Chasse, D ;
Joshi, MB ;
Harpole, D ;
Lancaster, JM ;
Berchuck, A ;
Olson, JA ;
Marks, JR ;
Dressman, HK ;
West, M ;
Nevins, JR .
NATURE, 2006, 439 (7074) :353-357
[5]
Emerging roles of E2Fs in cancer: an exit from cell cycle control [J].
Chen, Hui-Zi ;
Tsai, Shih-Yin ;
Leone, Gustavo .
NATURE REVIEWS CANCER, 2009, 9 (11) :785-797
[6]
Global functional profiling of gene expression [J].
Draghici, S ;
Khatri, P ;
Martins, RP ;
Ostermeier, GC ;
Krawetz, SA .
GENOMICS, 2003, 81 (02) :98-104
[7]
Galindo Cristi L., 2009, BMC Physiology, V9, P23, DOI 10.1186/1472-6793-9-23
[8]
Hartemink A J, 2001, Pac Symp Biocomput, P422
[9]
KEGG for representation and analysis of molecular networks involving diseases and drugs [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Furumichi, Miho ;
Tanabe, Mao ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D355-D360
[10]
Petkovsek M., 1996, A B