Analyzing time series gene expression data

被引:296
作者
Bar-Joseph, Z [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15217 USA
关键词
D O I
10.1093/bioinformatics/bth283
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Time series expression experiments are an increasingly popular method for studying a wide range of biological systems. However, when analyzing these experiments researchers face many new computational challenges. Algorithms that are specifically designed for time series experiments are required so that we can take advantage of their unique features (such as the ability to infer causality from the temporal response pattern) and address the unique problems they raise (e.g. handling the different non-uniform sampling rates). Results: We present a comprehensive review of the current research in time series expression data analysis. We divide the computational challenges into four analysis levels: experimental design, data analysis, pattern recognition and networks. For each of these levels, we discuss computational and biological problems at that level and point out some of the methods that have been proposed to deal with these issues. Many open problems in all these levels are discussed. This review is intended to serve as both, a point of reference for experimental biologists looking for practical solutions for analyzing their data, and a starting point for computer scientists interested in working on the computational problems related to time series expression analysis.
引用
收藏
页码:2493 / 2503
页数:11
相关论文
共 53 条
[31]   Transcriptional profiling shows that Gcn4p is a master regulator of gene expression during amino acid starvation in yeast [J].
Natarajan, K ;
Meyer, MR ;
Jackson, BM ;
Slade, D ;
Roberts, C ;
Hinnebusch, AG ;
Marton, MJ .
MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (13) :4347-4368
[32]   Human macrophage activation programs induced by bacterial pathogens [J].
Nau, GJ ;
Richmond, JFL ;
Schlesinger, A ;
Jennings, EG ;
Lander, ES ;
Young, RA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (03) :1503-1508
[33]   Coordinated transcription of key pathways in the mouse by the circadian clock [J].
Panda, S ;
Antoch, MP ;
Miller, BH ;
Su, AI ;
Schook, AB ;
Straume, M ;
Schultz, PG ;
Kay, SA ;
Takahashi, JS ;
Hogenesch, JB .
CELL, 2002, 109 (03) :307-320
[34]  
Pe'er D, 2001, Bioinformatics, V17 Suppl 1, pS215
[35]   Gene networks inference using dynamic Bayesian networks [J].
Perrin, Bruno-Edouard ;
Ralaivola, Liva ;
Mazurie, Aurelien ;
Bottani, Samuele ;
Mallet, Jacques ;
d'Alche-Buc, Florence .
BIOINFORMATICS, 2003, 19 :II138-II148
[36]   Conserved homeodomain proteins interact with MADS box protein Mcm1 to restrict ECB-dependent transcription to the M/G1 phase of the cell cycle [J].
Pramila, T ;
Miles, S ;
GuhaThakurta, D ;
Jemiolo, D ;
Breeden, LL .
GENES & DEVELOPMENT, 2002, 16 (23) :3034-3045
[37]   Beyond synexpression relationships: Local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions [J].
Qian, J ;
Dolled-Filhart, M ;
Lin, J ;
Yu, HY ;
Gerstein, M .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 314 (05) :1053-1066
[38]   Cluster analysis of gene expression dynamics [J].
Ramoni, MF ;
Sebatiani, P ;
Kohane, IS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (14) :9121-9126
[39]   Using hidden Markov models to analyze gene expression time course data [J].
Schliep, Alexander ;
Schoenhuth, Alexander ;
Steinhoff, Christine .
BIOINFORMATICS, 2003, 19 :i255-i263
[40]  
Sharan R, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P307