Evolutionary approaches for the reverse-engineering of gene regulatory networks: A study on a biologically realistic dataset

被引:21
作者
Auliac, Cedric [1 ,2 ]
Frouin, Vincent [2 ]
Gidrol, Xavier [2 ]
d'Alche-Buc, Florence [1 ]
机构
[1] Univ Evry Val Essonne, IBISC, Evry, France
[2] CEA, LEFG, IRCM, DSV, Evry, France
关键词
D O I
10.1186/1471-2105-9-91
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Inferring gene regulatory networks from data requires the development of algorithms devoted to structure extraction. When only static data are available, gene interactions may be modelled by a Bayesian Network (BN) that represents the presence of direct interactions from regulators to regulees by conditional probability distributions. We used enhanced evolutionary algorithms to stochastically evolve a set of candidate BN structures and found the model that best fits data without prior knowledge. Results: We proposed various evolutionary strategies suitable for the task and tested our choices using simulated data drawn from a given bio-realistic network of 35 nodes, the so-called insulin network, which has been used in the literature for benchmarking. We assessed the inferred models against this reference to obtain statistical performance results. We then compared performances of evolutionary algorithms using two kinds of recombination operators that operate at different scales in the graphs. We introduced a niching strategy that reinforces diversity through the population and avoided trapping of the algorithm in one local minimum in the early steps of learning. We show the limited effect of the mutation operator when niching is applied. Finally, we compared our best evolutionary approach with various well known learning algorithms (MCMC, K2, greedy search, TPDA, MMHC) devoted to BN structure learning. Conclusion: We studied the behaviour of an evolutionary approach enhanced by niching for the learning of gene regulatory networks with BN. We show that this approach outperforms classical structure learning methods in elucidating the original model. These results were obtained for the learning of a bio-realistic network and, more importantly, on various small datasets. This is a suitable approach for learning transcriptional regulatory networks from real datasets without prior knowledge.
引用
收藏
页数:14
相关论文
共 45 条
[1]  
ALIFERIS CF, 2003, CAUSAL EXPLORER CAUS, P371
[2]  
[Anonymous], 1975, ANAL BEHAV CLASS GEN
[3]  
[Anonymous], 1991, Equivalence and Synthesis of Causal Models
[4]   A guide to the literature on learning probabilistic networks from data [J].
Buntine, W .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (02) :195-210
[5]   Learning Bayesian networks from data: An information-theory based approach [J].
Cheng, J ;
Greiner, R ;
Kelly, J ;
Bell, D ;
Liu, WR .
ARTIFICIAL INTELLIGENCE, 2002, 137 (1-2) :43-90
[6]  
Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
[7]  
Chickering D.M., 1996, Learning Bayesian Networks is NPComplete, V112, P121, DOI DOI 10.1007/978-1-4612-2404-4_12
[8]  
CHICKERING DM, 1995, P 11 C UNC ART INT, P87
[9]  
COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
[10]  
COTTA C, 2002, P PARALLEL PROBLEM S, V7, P730