Unbiased pattern detection in microarray data series

被引:19
作者
Ahnert, S. E. [1 ]
Willbrand, K.
Brown, F. C. S.
Fink, T. M. A.
机构
[1] Univ Cambridge, Cavendish Lab, Cambridge CB3 0HE, England
[2] Ecole Normale Super, Lab Phys Stat, F-75231 Paris 05, France
[3] Ecole Normale Super, Dept Math & Applicat, F-75231 Paris 05, France
[4] Inst Curie, CNRS, UMR 144, F-75248 Paris 05, France
关键词
D O I
10.1093/bioinformatics/btl121
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Following the advent of microarray technology in recent years, the challenge for biologists is to identify genes of interest from the thousands of genetic expression levels measured in each microarray experiment. In many cases the aim is to identify pattern in the data series generated by successive microarray measurements. Results: Here we introduce a new method of detecting pattern in microarray data series which is independent of the nature of this pattern. Our approach provides a measure of the algorithmic compressibility of each data series. A series which is significantly compressible is much more likely to result from simple underlying mechanisms than series which are incompressible. Accordingly, the gene associated with a compressible series is more likely to be biologically significant. We test our method on microarray time series of yeast cell cycle and show that it blindly selects genes exhibiting the expected cyclic behaviour as well as detecting other forms of pattern. Our results successfully predict two independent non-microarray experimental studies.
引用
收藏
页码:1471 / 1476
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 2002, NAT GENET S, V32, P461
[2]   Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes [J].
Bar-Joseph, Z ;
Gerber, G ;
Simon, L ;
Gifford, DK ;
Jaakkola, TS ;
Jaakkola, TS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (18) :10146-10151
[3]   ON LENGTH OF PROGRAMS FOR COMPUTING FINITE BINARY SEQUENCES [J].
CHAITIN, GJ .
JOURNAL OF THE ACM, 1966, 13 (04) :547-+
[4]   A genome-wide transcriptional analysis of the mitotic cell cycle [J].
Cho, RJ ;
Campbell, MJ ;
Winzeler, EA ;
Steinmetz, L ;
Conway, A ;
Wodicka, L ;
Wolfsberg, TG ;
Gabrielian, AE ;
Landsman, D ;
Lockhart, DJ ;
Davis, RW .
MOLECULAR CELL, 1998, 2 (01) :65-73
[5]  
Cover TM, 2006, Elements of Information Theory
[6]  
Kolmogorov A., 1965, PROBL PEREDACHI INF, V1, P4
[7]   Yeast Protein Database (YPD): A database for the complete proteome of Saccharomyces cerevisiae [J].
Payne, WE ;
Garrels, JI .
NUCLEIC ACIDS RESEARCH, 1997, 25 (01) :57-62
[8]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423
[9]   Serial regulation of transcriptional regulators in the yeast cell cycle [J].
Simon, I ;
Barnett, J ;
Hannett, N ;
Harbison, CT ;
Rinaldi, NJ ;
Volkert, TL ;
Wyrick, JJ ;
Zeitlinger, J ;
Gifford, DK ;
Jaakkola, TS ;
Young, RA .
CELL, 2001, 106 (06) :697-708
[10]   Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization [J].
Spellman, PT ;
Sherlock, G ;
Zhang, MQ ;
Iyer, VR ;
Anders, K ;
Eisen, MB ;
Brown, PO ;
Botstein, D ;
Futcher, B .
MOLECULAR BIOLOGY OF THE CELL, 1998, 9 (12) :3273-3297