Analysis of gene expression data using functional principal components

被引:7
作者
Barra, V [1 ]
机构
[1] CNRS, LIMOS, UMR 6158, F-63117 Aubiere, France
关键词
DNA; gene; functional principal components; microarray;
D O I
10.1016/j.cmpb.2003.08.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 [计算机应用技术]; 0835 [软件工程];
摘要
The large amount of data involved in DNA microarrays implies the development of efficient computer algorithms to analyze the gene expressions, and thus to study the transcriptome. Numerous techniques already exist and we propose a new method based on the key idea that gene profiles may be considered as continuous curves. The analysis of the set of curves stemming from the DNA microarray may be then performed using a functional analysis which can exhibit the main modes of variations in this set, gather genes with similar variations and extract characteristic parameters of gene profiles. We aim here at introducing this method, called the Functional Principal Component Analysis. A prospective study has been performed on two available datasets, concerning on the one hand the sporulation data of the Saccharomyces cerevisiae, and on the other hand data of tumor cell tines. Results are very promising: the method is able to extract characteristic parameters from the datasets, to extract significant modes of variations in the set of gene profiles, and to link these variations to biological processes already studied in literature. (C) 2003 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 24 条
[1]
[Anonymous], 1997, SPRINGER SERIES STAT
[2]
A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[3]
Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[4]
The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[5]
The main biological determinants of tumor line taxonomy elucidated by a principal component analysis of microarray data [J].
Crescenzi, M ;
Giuliani, A .
FEBS LETTERS, 2001, 507 (01) :114-118
[6]
DUNTGERNAN G, 1989, PRINCIPAL COMPONENTS
[7]
Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]
Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[9]
GUTHKE R, 2001, INT SERIES INT TECHN
[10]
A hierarchical unsupervised growing neural network for clustering gene expression patterns [J].
Herrero, J ;
Valencia, A ;
Dopazo, J .
BIOINFORMATICS, 2001, 17 (02) :126-136