A novel network and sparsity constraint regression model for functional module identification in genomic data analysis

被引:4
作者
Xia, Zheng [1 ,2 ]
Chen, Wei [3 ]
Chang, Chunqi [3 ]
Zhou, Xiaobo [1 ,2 ]
机构
[1] Methodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
[2] Weill Cornell Med Coll, New York, NY 10065 USA
[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
Laplacian matrix; elastic net; pathway; sparsity; whole solution path; PENALIZED REGRESSION; ALZHEIMERS-DISEASE; VARIABLE SELECTION; PROTEIN; LASSO; REGULARIZATION; GENERATION;
D O I
10.1504/IJDMB.2013.056081
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is important to incorporate the accumulated biological pathways and interactions knowledge into genome-wide association studies to elucidate correlations between genetic variants and disease. Although a number of methods have been developed recently to identify disease related genes using prior biological knowledge, most methods only encourage the smoothness of the coefficients along the network which does not address the case where two connected genes both have positive or negative effects on the response. To overcome this issue, we propose to apply the Laplacian operation on the absolute values of the coefficients to take account of the positive and negative effects as well as a L-1 norm term to impose sparsity. Further, an efficient algorithm is developed to get the whole solution path. Simulation studies show that the proposed method has better performance than network-constrained regularisation without absolute values. Applying our method on a microarray data of Alzheimer's disease (AD) identifies several subnetworks on Kyoto Encyclopedia of Genes and Genomes (KEGG) transcriptional pathways that are related to progression of AD. Many of those findings are confirmed by published literature.
引用
收藏
页码:311 / 325
页数:15
相关论文
共 22 条
  • [1] Incipient Alzheimer's disease: Microarray correlation analyses reveal major transcriptional and tumor suppressor responses
    Blalock, EM
    Geddes, JW
    Chen, KC
    Porter, NM
    Markesbery, WR
    Landfield, PW
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (07) : 2173 - 2178
  • [2] Chung F., 1992, Spectral Graph Theory
  • [3] Chaperones increase association of tau protein with microtubules
    Dou, F
    Netzer, WJ
    Tanemura, K
    Li, F
    Hartl, FU
    Takashima, A
    Gouras, GK
    Greengard, P
    Xu, HX
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (02) : 721 - 726
  • [4] Least angle regression - Rejoinder
    Efron, B
    Hastie, T
    Johnstone, I
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
  • [5] Friedman I., 2008, REGULARIZED PA UNPUB
  • [6] Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NATURE PROTOCOLS, 2009, 4 (01) : 44 - 57
  • [7] STRING 8-a global view on proteins and their functional interactions in 630 organisms
    Jensen, Lars J.
    Kuhn, Michael
    Stark, Manuel
    Chaffron, Samuel
    Creevey, Chris
    Muller, Jean
    Doerks, Tobias
    Julien, Philippe
    Roth, Alexander
    Simonovic, Milan
    Bork, Peer
    von Mering, Christian
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D412 - D416
  • [8] A new functional screening system for identification of regulators for the generation of amyloid β-protein
    Komano, H
    Shiraishi, H
    Kawamura, Y
    Sai, X
    Suzuki, R
    Serneels, L
    Kawaichi, M
    Kitamura, T
    Yanagisawa, K
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (42) : 39627 - 39633
  • [9] Network-constrained regularization and variable selection for analysis of genomic data
    Li, Caiyan
    Li, Hongzhe
    [J]. BIOINFORMATICS, 2008, 24 (09) : 1175 - 1182
  • [10] Gene expression network analysis and applications to immunology
    Nacu, Serban
    Critchley-Thorne, Rebecca
    Lee, Peter
    Holmes, Susan
    [J]. BIOINFORMATICS, 2007, 23 (07) : 850 - 858