Biological network inference using low order partial correlation

被引:35
作者
Zuo, Yiming [1 ,2 ]
Yu, Guogiang [2 ]
Tadesse, Mahlet G. [3 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
基金
美国国家卫生研究院;
关键词
Systems biology; Undirected network inference; Correlation; Gaussian graphical models; Low order partial correlation; Biomarker discovery; INDEPENDENCE; EXPRESSION; DISCOVERY;
D O I
10.1016/j.ymeth.2014.06.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biological network inference is a major challenge in systems biology. Traditional correlation-based network analysis results in too many spurious edges since correlation cannot distinguish between direct and indirect associations. To address this issue, Gaussian graphical models (GGM) were proposed and have been widely used. Though they can significantly reduce the number of spurious edges, GGM are insufficient to uncover a network structure faithfully due to the fact that they only consider the full order partial correlation. Moreover, when the number of samples is smaller than the number of variables, further technique based on sparse regularization needs to be incorporated into GGM to solve the singular covariance inversion problem. In this paper, we propose an efficient and mathematically solid algorithm that infers biological networks by computing low order partial correlation (LOPC) up to the second order. The bias introduced by the low order constraint is minimal compared to the more reliable approximation of the network structure achieved. In addition, the algorithm is suitable for a dataset with small sample size but large number of variables. Simulation results show that LOPC yields far less spurious edges and works well under various conditions commonly seen in practice. The application to a real metabolomics dataset further validates the performance of LOPC and suggests its potential power in detecting novel biomarkers for complex disease. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:266 / 273
页数:8
相关论文
共 22 条
[1]   Network motifs: theory and experimental approaches [J].
Alon, Uri .
NATURE REVIEWS GENETICS, 2007, 8 (06) :450-461
[2]  
Anderson T. W., 2003, INTRO MULTIVARIATE S, P39
[3]  
[Anonymous], 2002, CAUSE CORRELATION BI, DOI DOI 10.1017/CBO9780511605949
[4]   Partial correlation and conditional correlation as measures of conditional independence [J].
Baba, K ;
Shibata, R ;
Sibuya, M .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2004, 46 (04) :657-664
[5]   Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks [J].
Butte, AJ ;
Tamayo, P ;
Slonim, D ;
Golub, TR ;
Kohane, IS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12182-12186
[6]   A new approach for learning belief networks using independence criteria [J].
de Campos, LM ;
Huete, JF .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2000, 24 (01) :11-37
[7]   Discovery of meaningful associations in genomic data using partial correlation coefficients [J].
de la Fuente, A ;
Bing, N ;
Hoeschele, I ;
Mendes, P .
BIOINFORMATICS, 2004, 20 (18) :3565-3574
[8]  
Edwards D., 2000, INTRO GRAPHICAL MODE
[9]  
Fisher RA, 1914, BIOMETRIKA, V10, P507
[10]   Sparse inverse covariance estimation with the graphical lasso [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Robert .
BIOSTATISTICS, 2008, 9 (03) :432-441