On the difference between low-rank and subspace approximation: improved model for multi-linear PLS regression

被引:59
作者
Bro, R [1 ]
Smilde, AK
de Jong, S
机构
[1] Royal Vet & Agr Univ, Dept Dairy & Food Sci, Chemometr Grp, DK-1958 Frederiksberg, Denmark
[2] Univ Amsterdam, Dept Chem Engn, NL-1018 WV Amsterdam, Netherlands
[3] Unilever Res Labs Vlaardingen, NL-3130 AC Vlaardingen, Netherlands
关键词
partial least squares; multi-way calibration; PARAFAC; tucker3;
D O I
10.1016/S0169-7439(01)00134-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While both Tucker3 and PARAFAC models can be viewed as latent variable models extending principal component analysis (PCA) to multi-way data, most fundamental properties of PCA do not extend to both models. This has practical importance, which will be explained in this paper. The fundamental difference between the PARAFAC and the Tucker3 model can be viewed as the difference between so-called low-rank and subspace approximation of the data. This insight is used to pose a modification of the multi-linear partial least squares regression (N-PLS) model. The modification is found by exploiting the basic properties of PLS and of multi-way models. Compared to the current prevalent implementation of N-PLS, the new model provides a more reasonable fit to the independent data and exactly the same predictions of the dependent variables. Thus, the reason for introducing this improved model is not to obtain better predictions, but rather the aim is to improve the secondary aspect of PLS: the modeling of the independent variables. The original version of N-PLS has some built-in problems that are easily circumvented with the modification suggested here. This is of importance, for example, in process monitoring, outlier detection and also, implicitly, for jackknifing of model parameters. Some examples are provided to illustrate some of these points. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:3 / 13
页数:11
相关论文
共 39 条
[1]   The N-way Toolbox for MATLAB [J].
Andersson, CA ;
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 52 (01) :1-4
[2]  
[Anonymous], 1971, GEN INVERSES MATRICE
[3]   Second-order data by flow injection analysis with spectrophotometric diode-array detection and incorporated gel-filtration chromatographic column [J].
Bechmann, IE .
TALANTA, 1997, 44 (04) :585-591
[4]   Monitoring and diagnosing batch processes with multiway covariates regression models [J].
Boqué, R ;
Smilde, AK .
AICHE JOURNAL, 1999, 45 (07) :1504-1520
[5]   PARAFAC. Tutorial and applications [J].
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 38 (02) :149-171
[6]   Exploratory study of sugar production using fluorescence spectroscopy and multi-way analysis [J].
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 46 (02) :133-147
[7]   Enzymatic browning of vegetables. Calibration and analysis of variance by multiway methods [J].
Bro, R ;
Heimdal, H .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 34 (01) :85-102
[8]  
Bro R, 1996, J CHEMOMETR, V10, P47, DOI 10.1002/(SICI)1099-128X(199601)10:1<47::AID-CEM400>3.3.CO
[9]  
2-3
[10]  
BRO R, 1998, THESIS U AMSTERDAM