A novel network and sparsity constraint regression model for functional module identification in genomic data analysis
被引:4
作者:
Xia, Zheng
论文数: 0引用数: 0
h-index: 0
机构:
Methodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Weill Cornell Med Coll, New York, NY 10065 USAMethodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Xia, Zheng
[1
,2
]
Chen, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R ChinaMethodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Chen, Wei
[3
]
Chang, Chunqi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R ChinaMethodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Chang, Chunqi
[3
]
Zhou, Xiaobo
论文数: 0引用数: 0
h-index: 0
机构:
Methodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Weill Cornell Med Coll, New York, NY 10065 USAMethodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
Zhou, Xiaobo
[1
,2
]
机构:
[1] Methodist Hosp, Res Inst, Dept Radiol, Houston, TX 77030 USA
[2] Weill Cornell Med Coll, New York, NY 10065 USA
[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China
It is important to incorporate the accumulated biological pathways and interactions knowledge into genome-wide association studies to elucidate correlations between genetic variants and disease. Although a number of methods have been developed recently to identify disease related genes using prior biological knowledge, most methods only encourage the smoothness of the coefficients along the network which does not address the case where two connected genes both have positive or negative effects on the response. To overcome this issue, we propose to apply the Laplacian operation on the absolute values of the coefficients to take account of the positive and negative effects as well as a L-1 norm term to impose sparsity. Further, an efficient algorithm is developed to get the whole solution path. Simulation studies show that the proposed method has better performance than network-constrained regularisation without absolute values. Applying our method on a microarray data of Alzheimer's disease (AD) identifies several subnetworks on Kyoto Encyclopedia of Genes and Genomes (KEGG) transcriptional pathways that are related to progression of AD. Many of those findings are confirmed by published literature.