On the difference between low-rank and subspace approximation: improved model for multi-linear PLS regression

被引:59
作者
Bro, R [1 ]
Smilde, AK
de Jong, S
机构
[1] Royal Vet & Agr Univ, Dept Dairy & Food Sci, Chemometr Grp, DK-1958 Frederiksberg, Denmark
[2] Univ Amsterdam, Dept Chem Engn, NL-1018 WV Amsterdam, Netherlands
[3] Unilever Res Labs Vlaardingen, NL-3130 AC Vlaardingen, Netherlands
关键词
partial least squares; multi-way calibration; PARAFAC; tucker3;
D O I
10.1016/S0169-7439(01)00134-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While both Tucker3 and PARAFAC models can be viewed as latent variable models extending principal component analysis (PCA) to multi-way data, most fundamental properties of PCA do not extend to both models. This has practical importance, which will be explained in this paper. The fundamental difference between the PARAFAC and the Tucker3 model can be viewed as the difference between so-called low-rank and subspace approximation of the data. This insight is used to pose a modification of the multi-linear partial least squares regression (N-PLS) model. The modification is found by exploiting the basic properties of PLS and of multi-way models. Compared to the current prevalent implementation of N-PLS, the new model provides a more reasonable fit to the independent data and exactly the same predictions of the dependent variables. Thus, the reason for introducing this improved model is not to obtain better predictions, but rather the aim is to improve the secondary aspect of PLS: the modeling of the independent variables. The original version of N-PLS has some built-in problems that are easily circumvented with the modification suggested here. This is of importance, for example, in process monitoring, outlier detection and also, implicitly, for jackknifing of model parameters. Some examples are provided to illustrate some of these points. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:3 / 13
页数:11
相关论文
共 39 条
[21]  
2-I
[22]  
KROONENBERG PM, 1983, 3 MODE PRINCIPAL COM
[23]  
Kruskal J.B., 1989, MULTIWAY DATA ANAL, P8
[24]   GENERATING VOCAL-TRACT SHAPES FROM FORMANT FREQUENCIES [J].
LADEFOGED, P ;
HARSHMAN, R ;
GOLDSTEIN, L ;
RICE, L .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 (04) :1027-1035
[25]  
Marsaglia G., 1974, LINEAR MULTILINEAR A, V2, P269, DOI DOI 10.1080/03081087408817070
[26]   Modified Jack-knife estimation of parameter uncertainty in bilinear modelling by partial least squares regression (PLSR) [J].
Martens, H ;
Martens, M .
FOOD QUALITY AND PREFERENCE, 2000, 11 (1-2) :5-16
[27]   A multiway 3D QSAR analysis of a series of (S)-N-[(1-ethyl-2-pyrrolidinyl)methyl]-6-methoxybenzamides [J].
Nilsson, J ;
Homan, EJ ;
Smilde, AK ;
Grol, CJ ;
Wikstrom, H .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1998, 12 (01) :81-93
[28]  
Nilsson J, 1997, J CHEMOMETR, V11, P511
[29]  
NORGAARD L, 1999, METHODE PLS, P187
[30]  
OSSENKOPP KP, 1985, NEUROBEH TOXICOL TER, V7, P95