Prediction of clinical outcome with microarray data:: a partial least squares discriminant analysis (PLS-DA) approach

被引:488
作者
Pérez-Enciso, M
Tenenhaus, M
机构
[1] INRA, Stn Ameliorat Genet Animaux, F-31326 Castanet Tolosan, France
[2] HEC Sch Management, F-78350 Jouy En Josas, France
关键词
D O I
10.1007/s00439-003-0921-9
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学]; 090102 [作物遗传育种];
摘要
Partial least squares discriminant analysis (PLS-DA) is a partial least squares regression of a set Y of binary variables describing the categories of a categorical variable on a set X of predictor variables. It is a compromise between the usual discriminant analysis and a discriminant analysis on the significant principal components of the predictor variables. This technique is specially suited to deal with a much larger number of predictors than observations and with multicollineality, two of the main problems encountered when analysing microarray expression data. We explore the performance of PLS-DA with published data from breast cancer (Perou et al. 2000). Several such analyses were carried out: (1) before vs after chemotherapy treatment, (2) estrogen receptor positive vs negative tumours, and (3) tumour classification. We found that the performance of PLS-DA was extremely satisfactory in all cases and that the discriminant cDNA clones often had a sound biological interpretation. We conclude that PLS-DA is a powerful yet simple tool for analysing microarray data.
引用
收藏
页码:581 / 592
页数:12
相关论文
共 29 条
[1]
Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]
Charpentier AH, 2000, CANCER RES, V60, P5977
[3]
Exploring relationships in gene expressions: A partial least squares approach [J].
Datta, S .
GENE EXPRESSION, 2001, 9 (06) :249-255
[4]
De Bruin A, 1999, CELL ADHES COMMUN, V7, P13
[5]
Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[6]
Eriksson L., 1999, INTRO MULTI MEGAVARI
[7]
A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[8]
Gershon D, 2002, NATURE, V416, P885, DOI 10.1038/416885a
[9]
Good P. I., 2000, PERMUTATION TESTS PR
[10]
Gruvberger S, 2001, CANCER RES, V61, P5979