Gene Coexpression Network Analysis as a Source of Functional Annotation for Rice Genes

被引:108
作者
Childs, Kevin L. [1 ]
Davidson, Rebecca M. [1 ]
Buell, C. Robin [1 ]
机构
[1] Michigan State Univ, Dept Plant Biol, E Lansing, MI 48824 USA
来源
PLOS ONE | 2011年 / 6卷 / 07期
基金
美国国家科学基金会;
关键词
TRANSCRIPTION FACTORS; EXPRESSION ATLAS; CLUSTER-ANALYSIS; DATABASE; PROTEINS; DEFENSE; BIOLOGY; MODULES; BIOINFORMATICS; SUPERFAMILY;
D O I
10.1371/journal.pone.0022196
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa) gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional annotation of those modules. Additionally, the expression patterns of genes across the treatments/conditions of an expression experiment comprise a second form of useful annotation.
引用
收藏
页数:12
相关论文
共 64 条
  • [1] Approaches for extracting practical information from gene co-expression networks in plant biology
    Aoki, Koh
    Ogata, Yoshiyuki
    Shibata, Daisuke
    [J]. PLANT AND CELL PHYSIOLOGY, 2007, 48 (03) : 381 - 390
  • [2] NCBI GEO: archive for high-throughput functional genomic data
    Barrett, Tanya
    Troup, Dennis B.
    Wilhite, Stephen E.
    Ledoux, Pierre
    Rudnev, Dmitry
    Evangelista, Carlos
    Kim, Irene F.
    Soboleva, Alexandra
    Tomashevsky, Maxim
    Marshall, Kimberly A.
    Phillippy, Katherine H.
    Sherman, Patti M.
    Muertter, Rolf N.
    Edgar, Ron
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D885 - D890
  • [3] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [4] Arabidopsis RETINOBLASTOMA-RELATED Is Required for Stem Cell Maintenance, Cell Differentiation, and Lateral Organ Production
    Borghi, Lorenzo
    Gutzat, Ruben
    Fuetterer, Johannes
    Laizet, Yec'han
    Hennig, Lars
    Gruissem, Wilhelm
    [J]. PLANT CELL, 2010, 22 (06) : 1792 - 1811
  • [5] Crystal structure of glycogen synthase: homologous enzymes catalyze glycogen synthesis and degradation
    Buschiazzo, A
    Ugalde, JE
    Guerin, ME
    Shepard, W
    Ugalde, RA
    Alzari, PM
    [J]. EMBO JOURNAL, 2004, 23 (16) : 3196 - 3205
  • [6] Identifying Modules of Coexpressed Transcript Units and Their Organization of Saccharopolyspora erythraea from Time Series Gene Expression Profiles
    Chang, Xiao
    Liu, Shuai
    Yu, Yong-Tao
    Li, Yi-Xue
    Li, Yuan-Yuan
    [J]. PLOS ONE, 2010, 5 (08):
  • [7] On the Choice and Number of Microarrays for Transcriptional Regulatory Network Inference
    Cosgrove, Elissa J.
    Gardner, Timothy S.
    Kolaczyk, Eric D.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [8] Dunwell JM, 1998, BIOTECHNOL GENET ENG, V15, P1
  • [9] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [10] Networks of WRKY transcription factors in defense signaling
    Eulgem, Thomas
    Somssich, Imre E.
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (04) : 366 - 371