WAVELENGTH SELECTION IN MULTICOMPONENT NEAR-INFRARED CALIBRATION

被引:90
作者
BROWN, PJ
机构
[1] Department of Statistics and Computational Mathematics, University of Liverpool, PO Box 147, Liverpool,L69 3BX, United Kingdom
关键词
NIR SPECTROSCOPY; WAVELENGTH SELECTION; INTERACTION EFFECTS; MULTICOMPONENT MIXTURES; PARTIAL LEAST SQUARES; GENERALIZED LEAST SQUARES;
D O I
10.1002/cem.1180060306
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern scanning (near-)infrared reflectance/absorption (NIR) spectroscopes measure the absorptions or reflectances at a sequence of around 1000 wavelengths. Training data may consist of 10-100 carefully designed sample mixtures for which the true composition of the mixture is either known by formulation or accurately determined by wet chemistry. In future one wishes to predict the true composition from the spectrum. In this paper we compare a simple wavelength selection approach with methods which retain all the wavelengths. It offers a powerful yet simple technique for choosing those wavelengths that are specific to each pure component as against the other components (including the medium) for the varying compositions. In the presence of a defined range of ingredients it thus chooses wavelengths which are highly selective for each particular component. It has the added advantage of selecting wavelengths which are little effected by interaction effects and consequent non-linearities. The calibration data used consist of 125 observations of three sugars, each varying at five levels in a full 5(3) design. The validation set consists of 21 further samples specially selected to have compositions outside the range of the training sample. The selection methods perform much better on this prediction set than methods which retain all the wavelengths, 700 in this case. The leave-one-out cross-validation internal to the calibration data would point to the opposite finding and suggests that such cross-validations may be overly flattering to techniques such as partial least squares and may encourage overfitting. After selection, simple straightforward least squares methods may be used, eschewing the need for 'shrinkage' methods such as partial least squares or ridge regression.
引用
收藏
页码:151 / 161
页数:11
相关论文
共 19 条
  • [1] Becker RA., 1988, WADSWORTH BROOKSCOLE, DOI DOI 10.1201/9781351074988
  • [2] CHEMOMETRICS AND SPECTRAL FREQUENCY SELECTION
    BROWN, PJ
    SPIEGELMAN, CH
    DENHAM, MC
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1991, 337 (1647): : 311 - 322
  • [3] BROWN PJ, 1989, BIOMETRIKA, V76, P349
  • [4] BROWN PJ, 1982, J R STAT SOC B, V44, P287
  • [5] BROWN PJ, 1987, J ROY STAT SOC B MET, V49, P46
  • [6] BROWN PJ, 1990, ANAL P, V27, P303
  • [7] BROWN PJ, 1992, CMASR392 AUSTR NAT U
  • [8] BROWN PJ, 1992, BAYESIAN STATISTICS, V4
  • [9] BROWN PJ, 1990, STATISTICAL ANAL MEA, P139
  • [10] THE EFFECT OF IGNORING SMALL MEASUREMENT ERRORS IN PRECISION INSTRUMENT CALIBRATION
    CARROLL, RJ
    SPIEGELMAN, CH
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 1986, 18 (03) : 170 - 173