Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks

被引:330
作者
Husmeier, D [1 ]
机构
[1] JCMB, Biomath & Stat Scotland, Edinburgh EH9 3JZ, Midlothian, Scotland
关键词
D O I
10.1093/bioinformatics/btg313
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Bayesian networks have been applied to infer genetic regulatory interactions from microarray gene expression data. This inference problem is particularly hard in that interactions between hundreds of genes have to be learned from very small data sets, typically containing only a few dozen time points during a cell cycle. Most previous studies have assessed the inference results on real gene expression data by comparing predicted genetic regulatory interactions with those known from the biological literature. This approach is controversial due to the absence of known gold standards, which renders the estimation of the sensitivity and specificity, that is, the true and (complementary) false detection rate, unreliable and difficult. The objective of the present study is to test the viability of the Bayesian network paradigm in a realistic simulation study. First, gene expression data are simulated from a realistic biological network involving DNAs, mRNAs, inactive protein monomers and active protein dimers. Then, interaction networks are inferred from these data in a reverse engineering approach, using Bayesian networks and Bayesian learning with Markov chain Monte Carlo. Results: The simulation results are presented as receiver operator characteristics curves. This allows estimating the proportion of spurious gene interactions incurred for a specified target proportion of recovered true interactions. The findings demonstrate how the network inference performance varies with the training set size, the degree of inadequacy of prior assumptions, the experimental sampling strategy and the inclusion of further, sequence-based information.
引用
收藏
页码:2271 / 2282
页数:12
相关论文
共 35 条
  • [21] Kauffman S., 1993, The Origins of Order
  • [22] METABOLIC STABILITY AND EPIGENESIS IN RANDOMLY CONSTRUCTED GENETIC NETS
    KAUFFMAN, SA
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1969, 22 (03) : 437 - &
  • [23] Learning probabilistic networks
    Krause, PJ
    [J]. KNOWLEDGE ENGINEERING REVIEW, 1998, 13 (04) : 321 - 351
  • [24] EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES
    METROPOLIS, N
    ROSENBLUTH, AW
    ROSENBLUTH, MN
    TELLER, AH
    TELLER, E
    [J]. JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) : 1087 - 1092
  • [25] Murphy K., 1999, Modelling gene expression data using dynamic Bayesian networks
  • [26] Ong Irene M, 2002, Bioinformatics, V18 Suppl 1, pS241
  • [27] Pe'er D, 2001, Bioinformatics, V17 Suppl 1, pS215
  • [28] Pearl P, 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
  • [29] RANGEL C, 2001, P 2 INT C SYST BIOL, P248
  • [30] Genome-wide location and function of DNA binding proteins
    Ren, B
    Robert, F
    Wyrick, JJ
    Aparicio, O
    Jennings, EG
    Simon, I
    Zeitlinger, J
    Schreiber, J
    Hannett, N
    Kanin, E
    Volkert, TL
    Wilson, CJ
    Bell, SP
    Young, RA
    [J]. SCIENCE, 2000, 290 (5500) : 2306 - +