Gene Association Networks from Microarray Data Using a Regularized Estimation of Partial Correlation Based on PLS Regression

被引:26
作者
Tenenhaus, Arthur [1 ]
Guillemot, Vincent [1 ,2 ]
Gidrol, Xavier [1 ]
Frouin, Vincent [1 ]
机构
[1] CEA, Inst Radiobiol Cellulaire & Mol, Lab Explorat Fonct Genomes, F-91000 Evry, France
[2] Supelec, Dept Signal & Elect Syst, F-91192 Gif Sur Yvette, France
关键词
Gene association networks; partial correlation; high-dimensional data; Partial Least Squares Regression; local false discovery rate; KERNEL ALGORITHM; CAUSAL NETWORKS; DATA SETS; C-MYC; EXPRESSION; SHRINKAGE; SELECTION; VALIDATION; DISCOVERY; VARIABLES;
D O I
10.1109/TCBB.2008.87
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Reconstruction of gene-gene interactions from large-scale data such as microarrays is a first step toward better understanding the mechanisms at work in the cell. Two main issues have to be managed in such a context: 1) choosing which measures have to be used to distinguish between direct and indirect interactions from high-dimensional microarray data and 2) constructing networks with a low proportion of false-positive edges. We present an efficient methodology for the reconstruction of gene interaction networks in a small-sample-size setting. The strength of independence of any two genes is measured, in such "high-dimensional network," by a regularized estimation of partial correlation based on Partial Least Squares Regression. We finally emphasize specific properties of the proposed method. To assess the sensitivity and specificity of the method, we carried out the reconstruction of networks from simulated data. We also tested PLS-based partial correlation network on static and dynamic real microarray data. An R implementation of the proposed algorithm is available from http://biodev.extra.cea.fr/plspcnetwork/.
引用
收藏
页码:251 / 262
页数:12
相关论文
共 54 条
[1]
[Anonymous], MOL SYSTEMS BIOL
[2]
[Anonymous], 2002, Journal of machine learning research
[3]
[Anonymous], 2005, Local False Discovery Rates
[4]
Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[5]
Reverse engineering of regulatory networks in human B cells [J].
Basso, K ;
Margolin, AA ;
Stolovitzky, G ;
Klein, U ;
Dalla-Favera, R ;
Califano, A .
NATURE GENETICS, 2005, 37 (04) :382-390
[6]
CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]
Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks [J].
Butte, AJ ;
Tamayo, P ;
Slonim, D ;
Golub, TR ;
Kohane, IS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12182-12186
[8]
Castelo R, 2006, J MACH LEARN RES, V7, P2621
[9]
D'haeseleer P, 1998, INFORMATION PROCESSING IN CELLS AND TISSUES, P203
[10]
Comparisons and validation of statistical clustering techniques for microarray gene expression data [J].
Datta, S ;
Datta, S .
BIOINFORMATICS, 2003, 19 (04) :459-466