Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks

被引:212
作者
Werhli, Adriano V. [1 ]
Grzegorczyk, Marco
Husmeier, Dirk
机构
[1] Biomath & Stat Scotland, Edinburgh, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
[3] Univ Dortmund, Dept Stat, D-44221 Dortmund, Germany
关键词
D O I
10.1093/bioinformatics/btl391
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: An important problem in systems biology is the inference of biochemical pathways and regulatory networks from postgenomic data. Various reverse engineering methods have been proposed in the literature, and it is important to understand their relative merits and shortcomings. In the present paper, we compare the accuracy of reconstructing gene regulatory networks with three different modelling and inference paradigms: (1) Relevance networks (RNs): pairwise association scores independent of the remaining network; (2) graphical Gaussian models (GGMs): undirected graphical models with constraint-based inference, and (3) Bayesian networks (BNs): directed graphical models with score-based inference. The evaluation is carried out on the Raf pathway, a cellular signalling network describing the interaction of 11 phosphorylated proteins and phospholipids in human immune system cells. We use both laboratory data from cytometry experiments as well as data simulated from the gold-standard network. We also compare passive observations with active interventions. Results: On Gaussian observational data, BNs and GGMs were found to outperform RNs. The difference in performance was not significant for the non-linear simulated data and the cytoflow data, though. Also, we did not observe a significant difference between BNs and GGMs on observational data in general. However, for interventional data, BNs outperform GGMs and RNs, especially when taking the edge directions rather than just the skeletons of the graphs into account. This suggests that the higher computational costs of inference with BNs over GGMs and RNs are not justified when using only passive observations, but that active interventions in the form of gene knockouts and over-expressions are required to exploit the full potential of BNs.
引用
收藏
页码:2523 / 2531
页数:9
相关论文
共 28 条
  • [1] [Anonymous], 2000, Introduction to Graphical Modelling
  • [2] Atkins P.W, 1986, PHYS CHEM, V3
  • [3] Butte A J, 2000, Pac Symp Biocomput, P418
  • [4] Butte AJ, 2003, ANAL GENE EXPRESSION, P428, DOI DOI 10.1007/0-387-21679-0_19
  • [5] CHICKERING DM, 1995, INT C UNC ART INT UA, V11, P87
  • [6] Regulation of raf-1 by direct feedback phosphorylation
    Dougherty, MK
    Müller, J
    Ritt, DA
    Zhou, M
    Zhou, XZ
    Copeland, TD
    Conrads, TP
    Veenstra, TD
    Lu, KP
    Morrison, DK
    [J]. MOLECULAR CELL, 2005, 17 (02) : 215 - 224
  • [7] Being Bayesian about network structure. A Bayesian approach to structure discovery in Bayesian networks
    Friedman, N
    Koller, D
    [J]. MACHINE LEARNING, 2003, 50 (1-2) : 95 - 125
  • [8] Using Bayesian networks to analyze expression data
    Friedman, N
    Linial, M
    Nachman, I
    Pe'er, D
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) : 601 - 620
  • [9] GEIGER D, 1994, P 10 C UNC ART INT, P235
  • [10] LEARNING BAYESIAN NETWORKS - THE COMBINATION OF KNOWLEDGE AND STATISTICAL-DATA
    HECKERMAN, D
    GEIGER, D
    CHICKERING, DM
    [J]. MACHINE LEARNING, 1995, 20 (03) : 197 - 243