Identifying periodically expressed transcripts in microarray time series data

被引:248
作者
Wichert, S
Fokianos, K
Strimmer, K
机构
[1] Univ Munich, Dept Stat, D-80539 Munich, Germany
[2] Univ Cyprus, Dept Math & Stat, CY-1678 Nicosia, Cyprus
关键词
D O I
10.1093/bioinformatics/btg364
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Microarray experiments are now routinely used to collect large-scale time series data, for example to monitor gene expression during the cell cycle. Statistical analysis of this data poses many challenges, one being that it is hard to identify correctly the subset of genes with a clear periodic signature. This has lead to a controversial argument with regard to the suitability of both available methods and current microarray data. Methods: We introduce two simple but efficient statistical methods for signal detection and gene selection in gene expression time series data. First, we suggest the average periodogram as an exploratory device for graphical assessment of the presence of periodic transcripts in the data. Second, we describe an exact statistical test to identify periodically expressed genes that allows one to distinguish periodic from purely random processes. This identification method is based on the so-called g-statistic and uses the false discovery rate approach to multiple testing. Results: Using simulated data it is shown that the suggested method is capable of identifying cell-cycle-activated genes in a gene expression data set even if the number of the cyclic genes is very small and regardless the presence of a dominant non-periodic component in the data. Subsequently, we re-examine 12 large microarray time series data sets (in part controversially discussed) from yeast, human fibroblast, human HeLa and bacterial cells. Based on the statistical analysis it is found that a majority of these data sets contained little or no statistical significant evidence for genes with periodic variation linked to cell cycle regulation. On the other hand, for the remaining data the method extends the catalog of previously known cell-cycle-specific transcripts by identifying additional periodic genes not found by other methods. The problem of distinguishing periodicity due to generic cell cycle activity and to artifacts from synchronization is also discussed.
引用
收藏
页码:5 / 20
页数:16
相关论文
共 14 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]   Transcriptional regulation and function during the human cell cycle [J].
Cho, RJ ;
Huang, MX ;
Campbell, MJ ;
Dong, HL ;
Steinmetz, L ;
Sapinoso, L ;
Hampton, G ;
Elledge, SJ ;
Davis, RW ;
Lockhart, DJ .
NATURE GENETICS, 2001, 27 (01) :48-54
[3]   Reappraisal of serum starvation, the restriction point, G0, and G1 phase arrest points [J].
Cooper, S .
FASEB JOURNAL, 2003, 17 (03) :333-340
[4]   Correspondence analysis applied to microarray data [J].
Fellenberg, K ;
Hauser, NC ;
Brors, B ;
Neutzner, A ;
Hoheisel, JD ;
Vingron, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) :10781-10786
[6]   Global analysis of the genetic network controlling a bacterial cell cycle [J].
Laub, MT ;
McAdams, TH ;
Feldblyum, T ;
Fraser, CM ;
Shapiro, L .
SCIENCE, 2000, 290 (5499) :2144-2148
[7]  
PRIESTLEY MB, 1981, SPECTRAL ANAL TIME S, V1
[8]   Geometry of gene expression dynamics [J].
Rifkin, SA ;
Kim, J .
BIOINFORMATICS, 2002, 18 (09) :1176-1183
[9]   Analysis of cell-cycle gene expression in Saccharomyces cerevisiae using microarrays and multiple synchronization methods [J].
Shedden, K ;
Cooper, S .
NUCLEIC ACIDS RESEARCH, 2002, 30 (13) :2920-2929
[10]   Analysis of cell-cycle-specific gene expression in human cells as determined by microarrays and double-thymidine block synchronization [J].
Shedden, K ;
Cooper, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (07) :4379-4384