Applying dynamic Bayesian networks to perturbed gene expression data

被引:106
作者
Dojer, Norbert
Gambin, Anna
Mizera, Andrzej
Wilczynski, Bartek
Tiuryn, Jerzy
机构
[1] Warsaw Univ, Inst Informat, PL-02097 Warsaw, Poland
[2] Polish Acad Sci, Inst Fundamental Technol Res, PL-00049 Warsaw, Poland
[3] Polish Acad Sci, Inst Math, PL-00956 Warsaw, Poland
关键词
D O I
10.1186/1471-2105-7-249
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A central goal of molecular biology is to understand the regulatory mechanisms of gene transcription and protein synthesis. Because of their solid basis in statistics, allowing to deal with the stochastic aspects of gene expressions and noisy measurements in a natural way, Bayesian networks appear attractive in the field of inferring gene interactions structure from microarray experiments data. However, the basic formalism has some disadvantages, e. g. it is sometimes hard to distinguish between the origin and the target of an interaction. Two kinds of microarray experiments yield data particularly rich in information regarding the direction of interactions: time series and perturbation experiments. In order to correctly handle them, the basic formalism must be modified. For example, dynamic Bayesian networks (DBN) apply to time series microarray data. To our knowledge the DBN technique has not been applied in the context of perturbation experiments. Results: We extend the framework of dynamic Bayesian networks in order to incorporate perturbations. Moreover, an exact algorithm for inferring an optimal network is proposed and a discretization method specialized for time series data from perturbation experiments is introduced. We apply our procedure to realistic simulations data. The results are compared with those obtained by standard DBN learning techniques. Moreover, the advantages of using exact learning algorithm instead of heuristic methods are analyzed. Conclusion: We show that the quality of inferred networks dramatically improves when using data from perturbation experiments. We also conclude that the exact algorithm should be used when it is possible, i.e. when considered set of genes is small enough.
引用
收藏
页数:11
相关论文
共 25 条
[1]  
Akutsu, 1998, Genome Inform Ser Workshop Genome Inform, V9, P151
[2]   Biological rhythms - Circadian clocks limited by noise [J].
Barkai, N ;
Leibler, S .
NATURE, 2000, 403 (6767) :267-268
[3]   Reverse engineering of regulatory networks in human B cells [J].
Basso, K ;
Margolin, AA ;
Stolovitzky, G ;
Klein, U ;
Dalla-Favera, R ;
Califano, A .
NATURE GENETICS, 2005, 37 (04) :382-390
[4]   How to make a Biological Switch [J].
Cherry, JL ;
Adler, FR .
JOURNAL OF THEORETICAL BIOLOGY, 2000, 203 (02) :117-133
[5]  
Chickering DM, 2004, J MACH LEARN RES, V5, P1287
[6]  
COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
[7]   Modeling and simulation of genetic regulatory systems: A literature review [J].
De Jong, H .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (01) :67-103
[8]   Chemogenomic profiling on a genomewide scale using reverse-engineered gene networks [J].
di Bernardo, D ;
Thompson, MJ ;
Gardner, TS ;
Chobot, SE ;
Eastwood, EL ;
Wojtovich, AP ;
Elliott, SJ ;
Schaus, SE ;
Collins, JJ .
NATURE BIOTECHNOLOGY, 2005, 23 (03) :377-383
[9]   Inferring cellular networks using probabilistic graphical models [J].
Friedman, N .
SCIENCE, 2004, 303 (5659) :799-805
[10]   Using Bayesian networks to analyze expression data [J].
Friedman, N ;
Linial, M ;
Nachman, I ;
Pe'er, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :601-620