Statistical modeling of large microarray data sets to identify stimulus-response profiles

被引:108
作者
Zhao, LP
Prentice, R
Breeden, L
机构
[1] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98006 USA
[2] Fred Hutchinson Canc Res Ctr, Div Basic Sci, Seattle, WA 98006 USA
关键词
D O I
10.1073/pnas.101013198
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A statistical modeling approach is proposed for use in searching large microarray data sets for genes that have a transcriptional response to a stimulus. The approach is unrestricted with respect to the timing, magnitude or duration of the response, or the overall abundance of the transcript. The statistical model makes an accommodation for systematic heterogeneity in expression levels. Corresponding data analyses provide gene-specific information. and the approach provides a means for evaluating the statistical significance of such information. To illustrate this strategy we have derived a model to depict the profile expected for a periodically transcribed gene and used it to look for budding yeast transcripts that adhere to this profile. Using objective criteria, this method identifies 81% of the known periodic transcripts and 1,088 genes, which show significant periodicity in at least one of the three data sets analyzed. However, only one-quarter of these genes show significant oscillations in at least two data sets and can be classified as periodic with high confidence. The method provides estimates of the mean activation and deactivation times, induced and basal expression levels, and statistical measures of the precision of these estimates for each periodic transcript.
引用
收藏
页码:5631 / 5636
页数:6
相关论文
共 17 条
[1]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]  
Breeden LL, 1997, METHOD ENZYMOL, V283, P332
[3]   A genome-wide transcriptional analysis of the mitotic cell cycle [J].
Cho, RJ ;
Campbell, MJ ;
Winzeler, EA ;
Steinmetz, L ;
Conway, A ;
Wodicka, L ;
Wolfsberg, TG ;
Gabrielian, AE ;
Landsman, D ;
Lockhart, DJ ;
Davis, RW .
MOLECULAR CELL, 1998, 2 (01) :65-73
[4]   Parallel analysis of genetic selections using whole genome oligonucleotide arrays [J].
Cho, RJ ;
Fromont-Racine, M ;
Wodicka, L ;
Feierbach, B ;
Stearns, T ;
Legrain, P ;
Lockhart, DJ ;
Davis, RW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (07) :3752-3757
[5]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[6]   LIGHT-DIRECTED, SPATIALLY ADDRESSABLE PARALLEL CHEMICAL SYNTHESIS [J].
FODOR, SPA ;
READ, JL ;
PIRRUNG, MC ;
STRYER, L ;
LU, AT ;
SOLAS, D .
SCIENCE, 1991, 251 (4995) :767-773
[7]   Exploring expression data: Identification and analysis of coexpressed genes [J].
Heyer, LJ ;
Kruglyak, S ;
Yooseph, S .
GENOME RESEARCH, 1999, 9 (11) :1106-1115
[8]   Fundamental patterns underlying gene expression profiles: Simplicity from complexity [J].
Holter, NS ;
Mitra, M ;
Maritan, A ;
Cieplak, M ;
Banavar, JR ;
Fedoroff, NV .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (15) :8409-8414
[9]   Array of hope [J].
Lander, ES .
NATURE GENETICS, 1999, 21 (Suppl 1) :3-4
[10]   LONGITUDINAL DATA-ANALYSIS USING GENERALIZED LINEAR-MODELS [J].
LIANG, KY ;
ZEGER, SL .
BIOMETRIKA, 1986, 73 (01) :13-22