WGCNA: an R package for weighted correlation network analysis

被引:16552
作者
Langfelder, Peter [1 ]
Horvath, Steve [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
关键词
D O I
10.1186/1471-2105-9-559
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters ( modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits ( using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e. g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial. Results: The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings. Conclusion: The WGCNA package provides R functions for weighted correlation network analysis, e. g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.
引用
收藏
页数:13
相关论文
共 48 条
  • [1] [Anonymous], 2017, Introduction to robust estimation and hypothesis testing
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] Using genetic markers to orient the edges in quantitative trait networks: The NEO software
    Aten, Jason E.
    Fuller, Tova F.
    Lusis, Aldons J.
    Horvath, Steve
    [J]. BMC SYSTEMS BIOLOGY, 2008, 2
  • [4] Network structures and algorithms in Bioconductor
    Carey, VJ
    Gentry, J
    Whalen, E
    Gentleman, R
    [J]. BIOINFORMATICS, 2005, 21 (01) : 135 - 136
  • [5] Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks
    Carlson, MRJ
    Zhang, B
    Fang, ZX
    Mischel, PS
    Horvath, S
    Nelson, SF
    [J]. BMC GENOMICS, 2006, 7 (1)
  • [6] A pattern recognition approach to infer time-lagged genetic interactions
    Chuang, Cheng-Long
    Jen, Chih-Hung
    Chen, Chung-Ming
    Shieh, Grace S.
    [J]. BIOINFORMATICS, 2008, 24 (09) : 1183 - 1190
  • [7] Modelling the network of cell cycle transcription factors in the yeast Saccharomyces cerevisiae
    Cokus, Shawn
    Rose, Sherri
    Haynor, David
    Gronbech-Jensen, Niels
    Pellegrini, Matteo
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [8] DAVID: Database for annotation, visualization, and integrated discovery
    Dennis, G
    Sherman, BT
    Hosack, DA
    Yang, J
    Gao, W
    Lane, HC
    Lempicki, RA
    [J]. GENOME BIOLOGY, 2003, 4 (09)
  • [9] Understanding network concepts in modules
    Dong, Jun
    Horvath, Steve
    [J]. BMC SYSTEMS BIOLOGY, 2007, 1
  • [10] DUDOIT S, 2002, STAT SINICA