A multivariate approach applied to microarray data for identification of genes with cell cycle-coupled transcription

被引:72
作者
Johansson, D [1 ]
Lindgren, P [1 ]
Berglund, A [1 ]
机构
[1] Umea Univ, Dept Chem, Chemometr Res Grp, S-90187 Umea, Sweden
关键词
D O I
10.1093/bioinformatics/btg017
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
We have analyzed microarray data using a modeling approach based on the multivariate statistical method partial least squares (PLS) regression to identify genes with periodic fluctuations in expression levels coupled to the cell cycle in the budding yeast, Saccharomyces cerevisiae. PLS has major advantages for analyzing microarray data since it can model data sets with large numbers of variables and with few observations. A response model was derived describing the expression profile over time expected for periodically transcribed genes, and was used to identify budding yeast transcripts with similar profiles. PLS was then used to interpret the importance of the variables (genes) for the model, yielding a ranking list of how well the genes fitted the generated model. Application of an appropriate cutoff value, calculated from randomized data, allows the identification of genes whose expression appears to be synchronized with cell cycling. Our approach also provides information about the stage in the cell cycle where their transcription peaks. Three synchronized yeast cell microarray data sets were analyzed, both separately and combined. Cell cycle-coupled periodicity was suggested for 455 of the 6,178 transcripts monitored in the combined data set, at a significance level of 0.5%. Among the candidates, 85% of the known periodic transcripts were included. Analysis of the three data sets separately yielded similar ranking lists, showing that the method is robust.
引用
收藏
页码:467 / 473
页数:7
相关论文
共 24 条
[1]
Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]
[Anonymous], 1996, PREDICTION METHODS S
[3]
[Anonymous], 1966, MULTIVARIATE ANAL P
[4]
[Anonymous], 1989, MULTIVARIATE CALIBRA
[5]
Gene expression informatics - it's all in your mine [J].
Bassett, DE ;
Eisen, MB ;
Boguski, MS .
NATURE GENETICS, 1999, 21 (Suppl 1) :51-55
[6]
Latent variable multivariate regression modeling [J].
Burnham, AJ ;
MacGregor, JF ;
Viveros, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 48 (02) :167-180
[7]
A genome-wide transcriptional analysis of the mitotic cell cycle [J].
Cho, RJ ;
Campbell, MJ ;
Winzeler, EA ;
Steinmetz, L ;
Conway, A ;
Wodicka, L ;
Wolfsberg, TG ;
Gabrielian, AE ;
Landsman, D ;
Lockhart, DJ ;
Davis, RW .
MOLECULAR CELL, 1998, 2 (01) :65-73
[8]
Overlapping and distinct roles of the duplicated yeast transcription factors Ace2p and Swi5p [J].
Doolin, MT ;
Johnson, AL ;
Johnston, LH ;
Butler, G .
MOLECULAR MICROBIOLOGY, 2001, 40 (02) :422-432
[9]
Correspondence analysis applied to microarray data [J].
Fellenberg, K ;
Hauser, NC ;
Brors, B ;
Neutzner, A ;
Hoheisel, JD ;
Vingron, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) :10781-10786
[10]
MULTIPLEXED BIOCHEMICAL ASSAYS WITH BIOLOGICAL CHIPS [J].
FODOR, SPA ;
RAVA, RP ;
HUANG, XHC ;
PEASE, AC ;
HOLMES, CP ;
ADAMS, CL .
NATURE, 1993, 364 (6437) :555-556