From gene expression to gene regulatory networks in Arabidopsis thaliana

被引:27
作者
Needham, Chris J. [1 ]
Manfield, Iain W. [2 ]
Bulpitt, Andrew J. [1 ]
Gilmartin, Philip M. [2 ]
Westhead, David R. [3 ]
机构
[1] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Leeds, Inst Integrat & Comparat Biol, Leeds LS2 9JT, W Yorkshire, England
[3] Univ Leeds, Inst Mol & Cellular Biol, Leeds LS2 9JT, W Yorkshire, England
基金
英国生物技术与生命科学研究理事会;
关键词
GATA TRANSCRIPTION FACTOR; CIRCADIAN CLOCK; MODELS;
D O I
10.1186/1752-0509-3-85
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Background: The elucidation of networks from a compendium of gene expression data is one of the goals of systems biology and can be a valuable source of new hypotheses for experimental researchers. For Arabidopsis, there exist several thousand microarrays which form a valuable resource from which to learn. Results: A novel Bayesian network-based algorithm to infer gene regulatory networks from gene expression data is introduced and applied to learn parts of the transcriptomic network in Arabidopsis thaliana from a large number (thousands) of separate microarray experiments. Starting from an initial set of genes of interest, a network is grown by iterative addition to the model of the gene, from another defined set of genes, which gives the 'best' learned network structure. The gene set for iterative growth can be as large as the entire genome. A number of networks are inferred and analysed; these show (i) an agreement with the current literature on the circadian clock network, (ii) the ability to model other networks, and (iii) that the learned network hypotheses can suggest new roles for poorly characterized genes, through addition of relevant genes from an unconstrained list of over 15,000 possible genes. To demonstrate the latter point, the method is used to suggest that particular GATA transcription factors are regulators of photosynthetic genes. Additionally, the performance in recovering a known network from different amounts of synthetically generated data is evaluated. Conclusion: Our results show that plausible regulatory networks can be learned from such gene expression data alone. This work demonstrates that network hypotheses can be generated from existing gene expression data for use by experimental biologists.
引用
收藏
页数:18
相关论文
共 44 条
[1]
Gene regulatory network models for plant development [J].
Alvarez-Buylla, Elena R. ;
Benitez, Mariana ;
Davila, Enrique Balleza ;
Chaos, Alvaro ;
Espinosa-Soto, Carlos ;
Padilla-Longoria, Pablo .
CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (01) :83-91
[2]
[Anonymous], APPL STAT
[3]
How to infer gene networks from expression profiles [J].
Bansal, Mukesh ;
Belcastro, Vincenzo ;
Ambesi-Impiombato, Alberto ;
di Bernardo, Diego .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[4]
Reverse engineering of regulatory networks in human B cells [J].
Basso, K ;
Margolin, AA ;
Stolovitzky, G ;
Klein, U ;
Dalla-Favera, R ;
Califano, A .
NATURE GENETICS, 2005, 37 (04) :382-390
[5]
Genetic analysis of Arabidopsis GATA transcription factor gene family reveals a nitrate-inducible member important for chlorophyll synthesis and glucose sensitivity [J].
Bi, YM ;
Zhang, Y ;
Signorelli, T ;
Zhao, R ;
Zhu, T ;
Rothstein, S .
PLANT JOURNAL, 2005, 44 (04) :680-692
[6]
NASCArrays: a repository for microarray data generated by NASC's transcriptomics service [J].
Craigon, DJ ;
James, N ;
Okyere, J ;
Higgins, J ;
Jotham, J ;
May, S .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D575-D577
[7]
FLOWERING LOCUS C mediates natural variation in the high-temperature response of the Arabidopsis circadian clock [J].
Edwards, KD ;
Anderson, PE ;
Hall, A ;
Salathia, NS ;
Locke, JCW ;
Lynn, JR ;
Straume, M ;
Smith, JQ ;
Millar, AJ .
PLANT CELL, 2006, 18 (03) :639-650
[8]
Low temperature induction of Arabidopsis CBF1, 2, and 3 is gated by the circadian clock [J].
Fowler, SG ;
Cook, D ;
Thomashow, ME .
PLANT PHYSIOLOGY, 2005, 137 (03) :961-968
[9]
Construction, visualisation, and clustering of transcription networks from Microarray expression data [J].
Freeman, Tom C. ;
Goldovsky, Leon ;
Brosch, Markus ;
Van Dongen, Stijn ;
Maziere, Pierre ;
Grocock, Russell J. ;
Freilich, Shiri ;
Thornton, Janet ;
Enright, Anton J. .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (10) :2032-2042
[10]
Inferring cellular networks using probabilistic graphical models [J].
Friedman, N .
SCIENCE, 2004, 303 (5659) :799-805