Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling

被引:136
作者
Toh, H
Horimoto, K
机构
[1] Biomol Engn Res Inst, Dept Bioinformat, Suita, Osaka 5650874, Japan
[2] Saga Med Sch, Math Lab, Saga 8498501, Japan
关键词
D O I
10.1093/bioinformatics/18.2.287
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent advances in DNA microarray technologies have made it possible to measure the expression levels of thousands of genes simultaneously under different conditions. The data obtained by microarray analyses are called expression profile data. One type of important information underlying the expression profile data is the 'genetic network,' that is, the regulatory network among genes. Graphical Gaussian Modeling (GGM) is a widely utilized method to infer or test relationships among a plural of variables. Results: In this study, we developed a method combining the cluster analysis with GGM for the inference of the genetic network from the expression profile data. The expression profile data of 2467 Saccharomyces cerevisiae genes measured under 79 different conditions (Eisen et al., Proc. Natl Acad. Sci. USA, 95, 14 683-14 868, 1998) were used for this study. At first, the 2467 genes were classified into 34 clusters by a cluster analysis, as a preprocessing for GGM. Then, the expression levels of the genes in each cluster were averaged for each condition. The averaged expression profile data of 34 clusters were subjected to GGM, and a partial correlation coefficient matrix was obtained as a model of the genetic network of S. cerevisiae. The accuracy of the inferred network was examined by the agreement of our results with the cumulative results of experimental studies.
引用
收藏
页码:287 / 297
页数:11
相关论文
共 19 条
[1]   Algorithms for identifying Boolean networks and related biological networks based on matrix multiplication and fingerprint function [J].
Akutsu, T ;
Miyano, S ;
Kuhara, S .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :331-343
[2]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[3]  
Chen T., 1999, P PAC S BIOC, P17
[4]  
D'haeseleer P, 1999, Pac Symp Biocomput, P41
[5]  
EDWARDS D, 1995, INTRO GRAPHICAL MODE
[6]   A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]   Using Bayesian networks to analyze expression data [J].
Friedman, N ;
Linial, M ;
Nachman, I ;
Pe'er, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :601-620
[9]  
HORIMOTO K, 2001, IN PRESS BIOINFOMATI
[10]   YEAST GLOBAL TRANSCRIPTIONAL REGULATORS SIN4 AND RGR1 ARE COMPONENTS OF MEDIATOR COMPLEX RNA-POLYMERASE-II HOLOENZYME [J].
LI, Y ;
BJORKLUND, S ;
JIANG, YW ;
KIM, YJ ;
LANE, WS ;
STILLMAN, DJ ;
KORNBERG, RD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (24) :10864-10868