WGCNA: an R package for weighted correlation network analysis

被引:16552
作者
Langfelder, Peter [1 ]
Horvath, Steve [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
关键词
D O I
10.1186/1471-2105-9-559
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters ( modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits ( using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e. g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. Results: The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. Conclusion: The WGCNA package provides R functions for weighted correlation network analysis, e. g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.
引用
收藏
页数:13
相关论文
共 48 条
  • [31] Inferring causal phenotype networks from segregating populations
    Neto, Elias Chaibub
    Ferrara, Christine T.
    Attie, Alan D.
    Yandell, Brian S.
    [J]. GENETICS, 2008, 179 (02) : 1089 - 1100
  • [32] Functional organization of the transcriptome in human brain
    Oldham, Michael C.
    Konopka, Genevieve
    Iwamoto, Kazuya
    Langfelder, Peter
    Kato, Tadafumi
    Horvath, Steve
    Geschwind, Daniel H.
    [J]. NATURE NEUROSCIENCE, 2008, 11 (11) : 1271 - 1282
  • [33] Conservation and evolution of gene colexpression networks in human and chimpanzee brains
    Oldham, Michael C.
    Horvath, Steve
    Geschwind, Daniel H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (47) : 17973 - 17978
  • [34] From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data
    Opgen-Rhein, Rainer
    Strimmer, Korbinian
    [J]. BMC SYSTEMS BIOLOGY, 2007, 1
  • [35] Integrated Weighted Gene Co-expression Network Analysis with an Application to Chronic Fatigue Syndrome
    Presson, Angela P.
    Sobel, Eric M.
    Papp, Jeanette C.
    Suarez, Charlyn J.
    Whistler, Toni
    Rajeevan, Mangalathu S.
    Vernon, Suzanne D.
    Horvath, Steve
    [J]. BMC SYSTEMS BIOLOGY, 2009, 2
  • [36] Hierarchical organization of modularity in metabolic networks
    Ravasz, E
    Somera, AL
    Mongru, DA
    Oltvai, ZN
    Barabási, AL
    [J]. SCIENCE, 2002, 297 (5586) : 1551 - 1555
  • [37] An empirical Bayes approach to inferring large-scale gene association networks
    Schäfer, J
    Strimmer, K
    [J]. BIOINFORMATICS, 2005, 21 (06) : 754 - 764
  • [38] Cytoscape: A software environment for integrated models of biomolecular interaction networks
    Shannon, P
    Markiel, A
    Ozier, O
    Baliga, NS
    Wang, JT
    Ramage, D
    Amin, N
    Schwikowski, B
    Ideker, T
    [J]. GENOME RESEARCH, 2003, 13 (11) : 2498 - 2504
  • [39] Automated modelling of signal transduction networks
    Steffen, M
    Petti, A
    Aach, J
    D'haeseleer, P
    Church, G
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)
  • [40] A gene-coexpression network for global discovery of conserved genetic modules
    Stuart, JM
    Segal, E
    Koller, D
    Kim, SK
    [J]. SCIENCE, 2003, 302 (5643) : 249 - 255