A modification of canonical variates analysis to handle highly collinear multivariate data

被引:75
作者
Norgaard, Lars
Bro, Rasmus
Westad, Frank
Engelsen, Soren Balling
机构
[1] Royal Vet & Agr Univ, Chemometr Grp, Dept Food Sci Qual & Technol, DK-1958 Frederiksberg C, Denmark
[2] Norwegian Food Res Inst, N-1430 As, Norway
关键词
canonical variates; discriminant analysis; classification; collinear; spectroscopy;
D O I
10.1002/cem.1017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A modification of the standard Canonical Variates Analysis (CVA) method to cope with collinear high-dimensional data is developed. The method utilizes Partial Least Squares regression as an engine for solving an eigenvector problem involving singular covariance matrices. Three data sets are analyzed to demonstrate the properties of the method: a two-group problem with near infrared spectroscopic data consisting of 60 samples and 376 variables, a multi-group problem with fluorescence spectroscopic data (1023 variables) consisting of 83 samples from six groups and a three-group problem with physical-chemical data (10 variables) consisting of 41 samples from three groups. It is demonstrated that the modified CVA method forces the discriminative information into the first canonical variates as expected. The weight vectors found in the modified CVA method possess the same properties as weight vectors of the standard CVA method. By combination of the suggested method with, for example, Linear Discriminant Analysis (LDA) as a classifier, an operational tool for classification and discrimination of collinear data is obtained. Copyright (c) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:425 / 435
页数:11
相关论文
共 24 条
[1]  
[Anonymous], 2000, PRINCIPLES MULTIVARI
[2]   Partial least squares for discrimination [J].
Barker, M ;
Rayens, W .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173
[3]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[4]  
FRANK I E, 1989, Journal of Chemometrics, V3, P463, DOI 10.1002/cem.1180030304
[5]  
Hart, 2006, PATTERN CLASSIFICATI
[6]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :498-520
[7]   Multivariate strategies for classification based on NIR-spectra -: with application to mayonnaise [J].
Indahl, UG ;
Sahni, NS ;
Kirkhus, B ;
Næs, T .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 49 (01) :19-31
[8]   Principal discriminant variate method for classification of multicollinear data: Applications to near-infrared spectra of cow blood samples [J].
Jiang, JH ;
Tsenkova, R ;
Wu, YQ ;
Yu, RQ ;
Ozaki, Y .
APPLIED SPECTROSCOPY, 2002, 56 (04) :488-501
[9]  
Jonathan P, 1996, J CHEMOMETR, V10, P189, DOI 10.1002/(SICI)1099-128X(199605)10:3<189::AID-CEM410>3.0.CO
[10]  
2-I