A novel network and sparsity constraint regression model for functional module identification in genomic data analysis

被引：4

作者：

Xia, Zheng ^{[1
,2
]}

Chen, Wei ^{[3
]}

Chang, Chunqi ^{[3
]}

Zhou, Xiaobo ^{[1
,2
]}

机构：

[1] Methodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA

[2] Weill Cornell Med Coll, New York, NY 10065 USA

[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS | 2013年 / 8卷 / 03期

关键词：

Laplacian matrix; elastic net; pathway; sparsity; whole solution path; PENALIZED REGRESSION; ALZHEIMERS-DISEASE; VARIABLE SELECTION; PROTEIN; LASSO; REGULARIZATION; GENERATION;

D O I：

10.1504/IJDMB.2013.056081

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

It is important to incorporate the accumulated biological pathways and interactions knowledge into genome-wide association studies to elucidate correlations between genetic variants and disease. Although a number of methods have been developed recently to identify disease related genes using prior biological knowledge, most methods only encourage the smoothness of the coefficients along the network which does not address the case where two connected genes both have positive or negative effects on the response. To overcome this issue, we propose to apply the Laplacian operation on the absolute values of the coefficients to take account of the positive and negative effects as well as a L-1 norm term to impose sparsity. Further, an efficient algorithm is developed to get the whole solution path. Simulation studies show that the proposed method has better performance than network-constrained regularisation without absolute values. Applying our method on a microarray data of Alzheimer's disease (AD) identifies several subnetworks on Kyoto Encyclopedia of Genes and Genomes (KEGG) transcriptional pathways that are related to progression of AD. Many of those findings are confirmed by published literature.

引用

页码：311 / 325

页数：15

共 22 条

[1] Incipient Alzheimer's disease: Microarray correlation analyses reveal major transcriptional and tumor suppressor responses
Blalock, EM
Geddes, JW
Chen, KC
Porter, NM
Markesbery, WR
Landfield, PW
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (07) : 2173 - 2178
[2] Chung F., 1992, Spectral Graph Theory
[3] Chaperones increase association of tau protein with microtubules
Dou, F
Netzer, WJ
Tanemura, K
Li, F
Hartl, FU
Takashima, A
Gouras, GK
Greengard, P
Xu, HX
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (02) : 721 - 726
[4] Least angle regression - Rejoinder
Efron, B
Hastie, T
Johnstone, I
Tibshirani, R
[J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
[5] Friedman I., 2008, REGULARIZED PA UNPUB
[6] Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
Huang, Da Wei
Sherman, Brad T.
Lempicki, Richard A.
[J]. NATURE PROTOCOLS, 2009, 4 (01) : 44 - 57
[7] STRING 8-a global view on proteins and their functional interactions in 630 organisms
Jensen, Lars J.
Kuhn, Michael
Stark, Manuel
Chaffron, Samuel
Creevey, Chris
Muller, Jean
Doerks, Tobias
Julien, Philippe
Roth, Alexander
Simonovic, Milan
Bork, Peer
von Mering, Christian
[J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D412 - D416
[8] A new functional screening system for identification of regulators for the generation of amyloid β-protein
Komano, H
Shiraishi, H
Kawamura, Y
Sai, X
Suzuki, R
Serneels, L
Kawaichi, M
Kitamura, T
Yanagisawa, K
[J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (42) : 39627 - 39633
[9] Network-constrained regularization and variable selection for analysis of genomic data
Li, Caiyan
Li, Hongzhe
[J]. BIOINFORMATICS, 2008, 24 (09) : 1175 - 1182
[10] Gene expression network analysis and applications to immunology
Nacu, Serban
Critchley-Thorne, Rebecca
Lee, Peter
Holmes, Susan
[J]. BIOINFORMATICS, 2007, 23 (07) : 850 - 858

← 1 2 3 →