THE PARSIMONY PRINCIPLE APPLIED TO MULTIVARIATE CALIBRATION

被引:182
作者
SEASHOLTZ, MB [1 ]
KOWALSKI, B [1 ]
机构
[1] UNIV WASHINGTON,CHEMOMETR LAB,BG-10,SEATTLE,WA 98195
关键词
MULTIVARIATE CALIBRATION; PARTIAL LEAST SQUARES; PRINCIPAL COMPONENT ANALYSIS; CHEMOMETRICS; MODEL SELECTION; MULTIVARIATE ADAPTIVE REPRESSION SPLINES; NEURAL NETWORKS; PARSIMONY PRINCIPLE; PROJECTION PURSUIT REGRESSION;
D O I
10.1016/0003-2670(93)80430-S
中图分类号
O65 [分析化学];
学科分类号
070302 [分析化学]; 081704 [应用化学];
摘要
The general principle of parsimonious data modeling states that if two models in some way adequately model a given set of data, the one that is described by a fewer number of parameters will have better predictive ability given new data. This concept is of interest in multivariate calibration since several new non-linear modeling techniques have become available. Three such methods are neural networks, projection pursuit regression (PPR) and multivariate adaptive regression splines (MARS). These methods, while capable of modeling non-linearities, typically have very many parameters that need to be estimated during the model building phase. The biased calibration methods, principal components regression (PCR) and partial least squares (PLS) are linear methods and so may not as efficiently describe some types of non-linearities, however have comparably very few parameters to be estimated. It is therefore of interest to study the parsimony principle formally in order to understand under what circumstances the various methods are appropriate. In this paper, the mathematical theory of parsimonious data modeling is presented. The assumptions made in the theory are shown to hold for multivariate calibration methods. This theory is used to provide a procedure for selecting the most parsimonious model structure for a given calibration application.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 38 条
[1]
NONLINEAR CALIBRATION USING PROJECTION PURSUIT REGRESSION - APPLICATION TO AN ARRAY OF ION-SELECTIVE ELECTRODES [J].
BEEBE, KR ;
KOWALSKI, BR .
ANALYTICAL CHEMISTRY, 1988, 60 (20) :2273-2278
[2]
Box G.E.P., 1976, TIME SERIES ANAL
[3]
HOW MANY VARIABLES SHOULD BE ENTERED IN A REGRESSION EQUATION [J].
BREIMAN, L ;
FREEDMAN, D .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (381) :131-136
[5]
LOCALLY WEIGHTED REGRESSION - AN APPROACH TO REGRESSION-ANALYSIS BY LOCAL FITTING [J].
CLEVELAND, WS ;
DEVLIN, SJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) :596-610
[6]
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[7]
DRAPER NR, 1981, APPLIED REGRESSION A, P337
[8]
A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[9]
EYKHOFF P, 1974, SYSTEM IDENTIFICATIO, P410
[10]
A NONLINEAR PLS MODEL [J].
FRANK, IE .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1990, 8 (02) :109-119