NONLINEAR METHODS FOR MULTIVARIATE STATISTICAL CALIBRATION AND THEIR USE IN PALEOECOLOGY - A COMPARISON OF INVERSE (K-NEAREST NEIGHBORS, PARTIAL LEAST-SQUARES AND WEIGHTED AVERAGING PARTIAL LEAST-SQUARES) AND CLASSICAL APPROACHES

被引:137
作者
TERBRAAK, CJF [1 ]
机构
[1] IBN, DLO, INST FORESTRY & NAT RES, 6700 AC WAGENINGEN, NETHERLANDS
关键词
D O I
10.1016/0169-7439(95)00002-E
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current environmental problems, such as acid rain and global warming, have greatly increased interest in fossil species assemblages as indicators of the palaeoenvironment and thus in quantitative methods for reconstructing environmental variables from species assemblage data. The ensuing multivariate calibration problem appears to be even harder than that of spectroscopic calibration, primarily because the basic model is unimodal (Shelford's law of tolerance) instead of being linear (Beer's law). The strong non-linearity has led to the use of non-parametric calibration methods, in particular the smooth response surface method (SRS) and the method of best modern analogues, alias k-nearest neighbours (k-NN), and to a form of non-linear partial least squares (PLS), called weighted averaging partial least squares (WA-PLS), specially designed to analyze unimodal data. SRS and k-NN are recognized as non-parametric smoothing versions of the classical and inverse approach to linear calibration, respectively, whereas PLS and WA-PLS are inverse methods that bring in the aspect of dimension reduction. In a comparison on 'realistically looking' simulated compositional data with 100 training samples and 500 independent evaluation samples, WA-PLS and k-NN outperformed PLS when the species response functions were unimodal. For such data, k-NN resisted the curse of dimensionality. However, when the response functions were near-linear, WA-PLS and PLS performed about equally and clearly outperformed k-NN. On other simulated data, simultaneous calibration of two climate variables via a parametric non-linear classical method was compared with individual calibrations via inverse methods. The simultaneous calibration method was better at the border of the sampled space than the best inverse method (WA-PLS) and much better than k-NN. The simulations demonstrated the limitations of the leave-one-out estimate of prediction error: it showed severe method-dependent bias.
引用
收藏
页码:165 / 180
页数:16
相关论文
共 83 条
[1]  
AITCHISON J, 1986, STATISTICAL ANAL COM
[2]  
ANDERSON JA, 1984, J R STAT SOC B, V46, P1
[3]  
ANDERSON NJ, 1993, TRENDS ECOL EVOL, V8, P356
[4]  
[Anonymous], 1977, THEORIES POPULATIONS
[5]  
Bartlein P. J., 1993, GEOLOGICAL SOC AM SP, P275, DOI DOI 10.1130/SPE276-P275
[6]   CLIMATIC RESPONSE SURFACES FROM POLLEN DATA FOR SOME EASTERN NORTH-AMERICAN TAXA [J].
BARTLEIN, PJ ;
PRENTICE, IC ;
WEBB, T .
JOURNAL OF BIOGEOGRAPHY, 1986, 13 (01) :35-57
[7]  
BARTLEIN PJ, 1985, SYLLOGEOUS, V55, P301
[8]   DIATOMS AND PH RECONSTRUCTION [J].
BIRKS, HJB ;
LINE, JM ;
JUGGINS, S ;
STEVENSON, AC ;
TERBRAAK, CJF .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1990, 327 (1240) :263-278
[9]  
Birks HJB, 1985, NUMERICAL METHODS QU
[10]  
Birks HJB, 1981, CLIMATE HIST, P111