Multivariate strategies for classification based on NIR-spectra -: with application to mayonnaise

被引:67
作者
Indahl, UG [1 ]
Sahni, NS [1 ]
Kirkhus, B [1 ]
Næs, T [1 ]
机构
[1] MATFORSK, N-1430 As Nlh, Norway
关键词
discriminant analysis; principal components; automatic variable selection; NIR; vegetable oils;
D O I
10.1016/S0169-7439(99)00023-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of the presented study is two-fold. First, we want to emphasize the power of Near Infrared Reflectance (NIR) spectroscopy for discrimination between mayonnaise samples containing different vegetable oils. Secondly, we want to use our data to compare the performances of different classification procedures. The Nm spectra with 351 variables correspond to equally spaced wavelengths in the 1100-2500 nm area. Feature extraction both by automatic wavelength-selection and by projection onto principal components (PCs) is discussed. The discriminant methods considered are linear discriminant analysis (LDA), quadratic discriminant analysis (QDA) and regression with categorical {0,1}-responses. A dataset containing 162 spectra of mayonnaise samples based on six different vegetable oils is analyzed. By LDA with authentic cross-validation (PC-models re-estimated for each cross-validation segment), only one sample was misclassified. Classification by allocating a sample according to the largest fitted value of a Linear regression (Discriminant-Partial least squares (DPLS) or Discriminant-Principal components regression (DPCR)) is demonstrated sub-optimal compared to LDA of the corresponding PLS- or PCR-scores. QDA significantly outperforms LDA for projections of the data onto subspaces of moderate size (scores of 7-9 PCs). Two automatic variable-selection procedures choose 16 and 26 wavelengths (variables), respectively from the spectra. Based on the selected wavelengths, LDA gives considerably better classification than the regression approach. By reporting the performances of several feature extraction techniques in tandem with three of the most common classification methods, we hope that the reader will notice two relevant aspects: (1) By using the DPLS and DPCR (classification by 'dummy' regressions) one is exposed to a significant risk of obtaining sub-optimal classification results; (2) The automatic wavelength selections may give valuable information about what is actually causing a successful discrimination. Such knowledge can, for instance, be used to select the most suited filters for online applications of NIR. Besides, from demonstrating different classification strategies, our study clearly shows that classification methods with NIR spectra can be used to discriminate between mayonnaise samples of different oil types and fatty acid composition. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:19 / 31
页数:13
相关论文
共 35 条
[1]  
[Anonymous], 1990, NEAR INFRARED TECHNO
[2]  
[Anonymous], 1979, Multivariate analysis
[3]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[4]  
BERTRAND D, 1990, J CHEMOMETR, V4, P411
[5]   DISCRIMINANT-ANALYSIS OF VEGETABLE-OILS BY NEAR-INFRARED REFLECTANCE SPECTROSCOPY [J].
BEWIG, KM ;
CLARKE, AD ;
ROBERTS, C ;
UNKLESBAY, N .
JOURNAL OF THE AMERICAN OIL CHEMISTS SOCIETY, 1994, 71 (02) :195-200
[6]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[7]  
BOOT AJ, 1994, J AOAC INT, V77, P1184
[8]   POTENTIAL OF FOURIER-TRANSFORM NEAR-INFRARED SPECTROSCOPY IN STUDIES OF THE DISSOCIATION OF FATTY-ACIDS IN THE LIQUID-PHASE [J].
CZARNECKI, MA ;
LIU, YL ;
OZAKI, Y ;
SUZUKI, M ;
IWAHASHI, M .
APPLIED SPECTROSCOPY, 1993, 47 (12) :2162-2168
[9]   Classification of vegetable oils by FT-IR [J].
Dahlberg, DB ;
Lee, SM ;
Wenger, SJ ;
Vargo, JA .
APPLIED SPECTROSCOPY, 1997, 51 (08) :1118-1124
[10]   CANONICAL CORRELATION-ANALYSIS OF MIDINFRARED AND NEAR-INFRARED OIL SPECTRA [J].
DEVAUX, MF ;
ROBERT, P ;
QANNARI, A ;
SAFAR, M ;
VIGNEAU, E .
APPLIED SPECTROSCOPY, 1993, 47 (07) :1024-1029