Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks

被引:330
作者
Husmeier, D [1 ]
机构
[1] JCMB, Biomath & Stat Scotland, Edinburgh EH9 3JZ, Midlothian, Scotland
关键词
D O I
10.1093/bioinformatics/btg313
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Bayesian networks have been applied to infer genetic regulatory interactions from microarray gene expression data. This inference problem is particularly hard in that interactions between hundreds of genes have to be learned from very small data sets, typically containing only a few dozen time points during a cell cycle. Most previous studies have assessed the inference results on real gene expression data by comparing predicted genetic regulatory interactions with those known from the biological literature. This approach is controversial due to the absence of known gold standards, which renders the estimation of the sensitivity and specificity, that is, the true and (complementary) false detection rate, unreliable and difficult. The objective of the present study is to test the viability of the Bayesian network paradigm in a realistic simulation study. First, gene expression data are simulated from a realistic biological network involving DNAs, mRNAs, inactive protein monomers and active protein dimers. Then, interaction networks are inferred from these data in a reverse engineering approach, using Bayesian networks and Bayesian learning with Markov chain Monte Carlo. Results: The simulation results are presented as receiver operator characteristics curves. This allows estimating the proportion of spurious gene interactions incurred for a specified target proportion of recovered true interactions. The findings demonstrate how the network inference performance varies with the training set size, the degree of inadequacy of prior assumptions, the experimental sampling strategy and the inclusion of further, sequence-based information.
引用
收藏
页码:2271 / 2282
页数:12
相关论文
共 35 条
  • [1] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [2] [Anonymous], BAYES NET TOOLBOX
  • [3] [Anonymous], LEARNING DATA ARTIFI
  • [4] ATKINS PW, 1986, PHYSICAL CHEM
  • [5] Biological rhythms - Circadian clocks limited by noise
    Barkai, N
    Leibler, S
    [J]. NATURE, 2000, 403 (6767) : 267 - 268
  • [6] CHEN T, 1999, P PAC S BIOC, V4, P29
  • [7] UNDERSTANDING THE METROPOLIS-HASTINGS ALGORITHM
    CHIB, S
    GREENBERG, E
    [J]. AMERICAN STATISTICIAN, 1995, 49 (04) : 327 - 335
  • [8] CHICKERING DM, 1995, INT C UNC ART INT UA, V11, P87
  • [9] Genetic network inference: from co-expression clustering to reverse engineering
    D'haeseleer, P
    Liang, SD
    Somogyi, R
    [J]. BIOINFORMATICS, 2000, 16 (08) : 707 - 726
  • [10] Modeling and simulation of genetic regulatory systems: A literature review
    De Jong, H
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (01) : 67 - 103