Selecting the optimum number of partial least squares components for the calibration of attenuated total reflectance-mid-infrared spectra of undesigned kerosene samples

被引:45
作者
Gomez-Carracedo, M. P.
Andrade, J. M.
Rutledge, D. N.
Faber, N. M.
机构
[1] Univ A Coruna, Dept Analyt Chem, La Coruna 15071, Spain
[2] Inst Natl Agron Paris Grignon, Chim Analyt Lab, F-75005 Paris, France
[3] Chemometry Consultancy, NL-6717 VD Ede, Netherlands
关键词
partial least squares dimensionality; cross-validation; smoothed partial least squares; randomization test; kerosene; infrared spectrometry;
D O I
10.1016/j.aca.2006.12.036
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Selecting the correct dimensionality is critical for obtaining partial least squares (PLS) regression models with good predictive ability. Although calibration and validation sets are best established using experimental designs, industrial laboratories cannot afford such an approach. Typically, samples are collected in an (formally) undesigned way, spread overtime and their measurements are included in routine measurement processes. This makes it hard to evaluate PLS model dimensionality. In this paper, classical criteria (leave-one-out cross-validation and adjusted Wold's criterion) are compared to recently proposed alternatives (smoothed PLS-PoLiSh and a randomization test) to seek out the optimum dimensionality of PLS models. Kerosene (jet fuel) samples were measured by attenuated total reflectance-mid-IR spectrometry and their spectra where used to predict eight important properties determined using reference methods that are time-consuming and prone to analytical errors. The alternative methods were shown to give reliable dimensionality predictions when compared to external validation. By contrast, the simpler methods seemed to be largely affected by the largest changes in the modeling capabilities of the first components. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:253 / 265
页数:13
相关论文
共 29 条
[1]   Applicability of high-absorbance MIR spectroscopy in industrial quality control of reformed gasolines [J].
Andrade, JM ;
Sánchez, MS ;
Sarabia, LA .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 46 (01) :41-55
[2]   Use of multivariate techniques in quality control of kerosene production [J].
Andrade, JM ;
Muniategui, S ;
LopezMahia, P ;
Prada, D .
FUEL, 1997, 76 (01) :51-59
[3]   Non-destructive and clean prediction of aviation fuel characteristics through Fourier transform-Raman spectroscopy and multivariate calibration [J].
Andrade, JM ;
Garrigues, S ;
de la Guardia, M ;
Gómez-Carracedo, M ;
Prada, D .
ANALYTICA CHIMICA ACTA, 2003, 482 (01) :115-128
[4]  
*ASME, 2005, 1655 ASME
[5]   Comparison of near-infrared and mid-infrared spectroscopy for the determination of distillation property of kerosene [J].
Chung, H ;
Ku, MS ;
Lee, JS .
VIBRATIONAL SPECTROSCOPY, 1999, 20 (02) :155-163
[6]   A strategy for selecting calibration samples for multivariate modelling [J].
Dantas, HA ;
Galvao, RKH ;
Araújo, MCU ;
da Silva, EC ;
Saldanha, TCB ;
José, GE ;
Pasquini, C ;
Raimundo, IM ;
Rohwedder, JJR .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (01) :83-91
[7]   Improving QSAR models for the biological activity of HIV Reverse Transcriptase inhibitors: Aspects of outlier detection and uninformative variable elimination [J].
Daszykowski, M ;
Stanimirova, I ;
Walczak, B ;
Daeyaert, F ;
de Jonge, MR ;
Heeres, J ;
Koymans, LMH ;
Lewi, PJ ;
Vinkers, HM ;
Janssen, PA ;
Massart, DL .
TALANTA, 2005, 68 (01) :54-60
[8]  
Deane J.M., 1989, J CHEMOMETR, V3, P477
[9]   Improved prediction error estimates for multivariate calibration by correcting for the measurement error in the reference values [J].
Faber, K ;
Kowalski, BR .
APPLIED SPECTROSCOPY, 1997, 51 (05) :660-665
[10]   Characterizing the uncertainty in near-infrared spectroscopic prediction of mixed oxygenate concentrations in gasoline: Sample-specific prediction intervals [J].
Faber, NM ;
Duewer, DL ;
Choquette, SJ ;
Green, TL ;
Chesler, SN .
ANALYTICAL CHEMISTRY, 1998, 70 (14) :2972-2982