Exploring relationships in gene expressions: A partial least squares approach

被引:42
作者
Datta, S [1 ]
机构
[1] Georgia State Univ, Dept Math & Stat, Atlanta, GA 30303 USA
来源
GENE EXPRESSION | 2001年 / 9卷 / 06期
关键词
microarray; gene expression data; partial least squares; yeast; modeling;
D O I
10.3727/000000001783992498
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Microarray technology has revolutionized the way gene functions are monitored. Analysis of microarray data is a fast growing research area that interfaces various disciplines such as biology, biochemistry, computer science, and statistics. While various clustering and classification techniques have been successfully employed to group genes based on the similarity of their expression patterns, much is yet to be learned about the interrelationship of the expression levels among various genes. We approach this problem with a statistical technique called partial least squares that is capable of modeling a large number of variables each with relatively few observations. This property of the partial least squares methodology appears to be attractive for application to microarray data sets where the simultaneous expression levels of many genes are collected each at a few time points (or individuals). We use it to analyze publicly available microarray data on sporulation of budding yeast (Saccharomyces cerevisiae). We investigate a number of representative genes, one from each temporal group (based on the time of first induction) of positively expressed genes and show that in each case most of the variability was explained by only two partial regression terms based on all remaining genes. Moreover, the predicted expression levels of the representative genes from partial least squares fit very well on the average with the true expression levels over time. Finally, we compare the biological functions of the genes with largest coefficients with those of the predicted genes. In many cases, the genes are involved in similar or related biological functions including negative relationships. We show that this method can identify established gene relationships; we argue that it can be an exploratory tool for identifying potential gene relationships requiring further biological investigation.
引用
收藏
页码:249 / 255
页数:7
相关论文
共 11 条
[1]   Gene expression data analysis [J].
Brazma, A ;
Vilo, J .
FEBS LETTERS, 2000, 480 (01) :17-24
[2]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[3]  
Brown P. J., 1993, MEASUREMENT REGRESSI
[4]   The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[5]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[6]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[7]   CLN1 and its repression by Xbp1 are important for efficient sporulation in budding yeast [J].
Mai, B ;
Breeden, L .
MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (02) :478-487
[8]  
STONE M, 1990, J ROY STAT SOC B MET, V52, P237
[9]  
STONE M, 1992, J ROY STAT SOC B MET, V54, P906
[10]   Analysis of gene expression data using self-organizing maps [J].
Törönen, P ;
Kolehmainen, M ;
Wong, C ;
Castrén, E .
FEBS LETTERS, 1999, 451 (02) :142-146