Integrated Weighted Gene Co-expression Network Analysis with an Application to Chronic Fatigue Syndrome

被引:138
作者
Presson, Angela P. [1 ]
Sobel, Eric M. [1 ]
Papp, Jeanette C. [1 ]
Suarez, Charlyn J. [1 ]
Whistler, Toni [2 ]
Rajeevan, Mangalathu S. [2 ]
Vernon, Suzanne D. [2 ,3 ]
Horvath, Steve [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Ctr Dis Control & Prevent, Div Viral & Rickettsial Dis, Natl Ctr Zoonot Vector Borne & Enter Dis, Atlanta, GA USA
[3] CFIDS, Charlotte, NC USA
关键词
MENDELIAN RANDOMIZATION; HPA AXIS; EXPRESSION; POLYMORPHISMS; ENCEPHALOMYOPATHY; IMMUNODEFICIENCY; IDENTIFICATION; PREVALENCE; ALOPECIA; MUTATION;
D O I
10.1186/1752-0509-2-95
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Systems biologic approaches such as Weighted Gene Co-expression Network Analysis (WGCNA) can effectively integrate gene expression and trait data to identify pathways and candidate biomarkers. Here we show that the additional inclusion of genetic marker data allows one to characterize network relationships as causal or reactive in a chronic fatigue syndrome (CFS) data set. Results: We combine WGCNA with genetic marker data to identify a disease-related pathway and its causal drivers, an analysis which we refer to as "Integrated WGCNA" or IWGCNA. Specifically, we present the following IWGCNA approach: 1) construct a co-expression network, 2) identify trait-related modules within the network, 3) use a trait-related genetic marker to prioritize genes within the module, 4) apply an integrated gene screening strategy to identify candidate genes and 5) carry out causality testing to verify and/or prioritize results. By applying this strategy to a CFS data set consisting of microarray, SNP and clinical trait data, we identify a module of 299 highly correlated genes that is associated with CFS severity. Our integrated gene screening strategy results in 20 candidate genes. We show that our approach yields biologically interesting genes that function in the same pathway and are causal drivers for their parent module. We use a separate data set to replicate findings and use Ingenuity Pathways Analysis software to functionally annotate the candidate gene pathways. Conclusion: We show how WGCNA can be combined with genetic marker data to identify disease-related pathways and the causal drivers within them. The systems genetics approach described here can easily be used to generate testable genetic hypotheses in other complex disease studies.
引用
收藏
页数:21
相关论文
共 72 条
  • [31] Hypothalamic digoxin, cerebral chemical dominance and myalgic encephalomyelitis
    Kurup, RK
    Kurup, PA
    [J]. INTERNATIONAL JOURNAL OF NEUROSCIENCE, 2003, 113 (05) : 683 - 701
  • [32] Enterovirus related metabolic myopathy: a postviral fatigue syndrome
    Lane, RJM
    Soteriou, BA
    Zhang, H
    Archard, LC
    [J]. JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY, 2003, 74 (10) : 1382 - 1386
  • [33] Network neighborhood analysis with the multi-node topological overlap measure
    Li, Ai
    Horvath, Steve
    [J]. BIOINFORMATICS, 2007, 23 (02) : 222 - 231
  • [34] Structural model analysis of multiple quantitative traits
    Li, Renhua
    Tsaih, Shirng-Wern
    Shockley, Keith
    Stylianou, Ioannis M.
    Wergedal, Jon
    Paigen, Beverly
    Churchill, Gary A.
    [J]. PLOS GENETICS, 2006, 2 (07): : 1046 - 1057
  • [35] Mendelian randomisation: a new spin or real progress?
    Little, J
    Khoury, MJ
    [J]. LANCET, 2003, 362 (9388) : 930 - 931
  • [36] PREVALENCE OF CHRONIC FATIGUE SYNDROME IN AN AUSTRALIAN POPULATION
    LLOYD, AR
    HICKIE, I
    BOUGHTON, CR
    SPENCER, O
    WAKEFIELD, D
    [J]. MEDICAL JOURNAL OF AUSTRALIA, 1990, 153 (09) : 522 - 528
  • [37] LLOYD AR, 1991, BRAIN, V114, P85
  • [38] Inferring causal phenotype networks from segregating populations
    Neto, Elias Chaibub
    Ferrara, Christine T.
    Attie, Alan D.
    Yandell, Brian S.
    [J]. GENETICS, 2008, 179 (02) : 1089 - 1100
  • [39] Conservation and evolution of gene colexpression networks in human and chimpanzee brains
    Oldham, Michael C.
    Horvath, Steve
    Geschwind, Daniel H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (47) : 17973 - 17978
  • [40] From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data
    Opgen-Rhein, Rainer
    Strimmer, Korbinian
    [J]. BMC SYSTEMS BIOLOGY, 2007, 1