Functional Network Construction in Arabidopsis Using Rule-Based Machine Learning on Large-Scale Data Sets

被引:77
作者
Bassel, George W. [1 ,2 ]
Glaab, Enrico [3 ]
Marquez, Julietta [1 ]
Holdsworth, Michael J. [1 ,2 ]
Bacardit, Jaume [4 ,5 ]
机构
[1] Univ Nottingham, Div Plant & Crop Sci, Loughborough LE12 5RD, Leics, England
[2] Univ Nottingham, Ctr Plant Integrat Biol, Loughborough LE12 5RD, Leics, England
[3] Univ Nottingham, Sch Comp Sci, Nottingham NG8 1BB, Notts, England
[4] Sch Comp Sci, ASAP Res Grp, Nottingham NG8 1BB, England
[5] Univ Nottingham, Sch Biosci, Multidisciplinary Ctr Integrat Biol, Loughborough LE12 5RD, England
基金
英国生物技术与生命科学研究理事会; 英国工程与自然科学研究理事会;
关键词
SEED-GERMINATION; GENE-EXPRESSION; GIBBERELLIN BIOSYNTHESIS; MICROARRAY ANALYSIS; HYPOTHESIS GENERATION; LOW-TEMPERATURE; REVEALS; TRANSCRIPTION; PREDICTION; DORMANCY;
D O I
10.1105/tpc.111.088153
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The meta-analysis of large-scale postgenomics data sets within public databases promises to provide important novel biological knowledge. Statistical approaches including correlation analyses in coexpression studies of gene expression have emerged as tools to elucidate gene function using these data sets. Here, we present a powerful and novel alternative methodology to computationally identify functional relationships between genes from microarray data sets using rule-based machine learning. This approach, termed "coprediction," is based on the collective ability of groups of genes co-occurring within rules to accurately predict the developmental outcome of a biological system. We demonstrate the utility of coprediction as a powerful analytical tool using publicly available microarray data generated exclusively from Arabidopsis thaliana seeds to compute a functional gene interaction network, termed Seed Co-Prediction Network (SCoPNet). SCoPNet predicts functional associations between genes acting in the same developmental and signal transduction pathways irrespective of the similarity in their respective gene expression patterns. Using SCoPNet, we identified four novel regulators of seed germination (ALTERED SEED GERMINATION5, 6, 7, and 8), and predicted interactions at the level of transcript abundance between these novel and previously described factors influencing Arabidopsis seed germination. An online Web tool to query SCoPNet has been developed as a community resource to dissect seed biology and is available at http://www.vseed.nottingham.ac.uk/.
引用
收藏
页码:3101 / 3116
页数:16
相关论文
共 60 条
[1]   Genome-wide Insertional mutagenesis of Arabidopsis thaliana [J].
Alonso, JM ;
Stepanova, AN ;
Leisse, TJ ;
Kim, CJ ;
Chen, HM ;
Shinn, P ;
Stevenson, DK ;
Zimmerman, J ;
Barajas, P ;
Cheuk, R ;
Gadrinab, C ;
Heller, C ;
Jeske, A ;
Koesema, E ;
Meyers, CC ;
Parker, H ;
Prednis, L ;
Ansari, Y ;
Choy, N ;
Deen, H ;
Geralt, M ;
Hazari, N ;
Hom, E ;
Karnes, M ;
Mulholland, C ;
Ndubaku, R ;
Schmidt, I ;
Guzman, P ;
Aguilar-Henonin, L ;
Schmid, M ;
Weigel, D ;
Carter, DE ;
Marchand, T ;
Risseeuw, E ;
Brogden, D ;
Zeko, A ;
Crosby, WL ;
Berry, CC ;
Ecker, JR .
SCIENCE, 2003, 301 (5633) :653-657
[2]  
[Anonymous], 1993, C4 5 PROGRAMS MACHIN
[3]  
[Anonymous], INT S GRAPH DRAW
[4]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[5]  
Bacardit J, 2004, LECT NOTES COMPUT SC, V3242, P1021
[6]  
Bacardit J., 2009, MEMET COMPUT, V1, P55
[7]   Automated Alphabet Reduction for Protein Datasets [J].
Bacardit, Jaume ;
Stout, Michael ;
Hirst, Jonathan D. ;
Valencia, Alfonso ;
Smith, Robert E. ;
Krasnogor, Natalio .
BMC BIOINFORMATICS, 2009, 10
[8]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[9]   Elucidating the germination transcriptional program using small molecules [J].
Bassel, George W. ;
Fung, Pauline ;
Chow, Tsz-Fung Freeman ;
Foong, Justin A. ;
Provart, Nicholas J. ;
Cutler, Sean R. .
PLANT PHYSIOLOGY, 2008, 147 (01) :143-155
[10]   Genome-wide network model capturing seed germination reveals coordinated regulation of plant cellular phase transitions [J].
Bassel, George W. ;
Lan, Hui ;
Glaab, Enrico ;
Gibbs, Daniel J. ;
Gerjets, Tanja ;
Krasnogor, Natalio ;
Bonner, Anthony J. ;
Holdsworth, Michael J. ;
Provart, Nicholas J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (23) :9709-9714