Theoretical justification of wavelength selection in PLS calibration development of a new algorithm

被引:241
作者
Spiegelman, CH [1 ]
McShane, MJ
Goetz, MJ
Motamedi, M
Yue, QL
Coté, GL
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77845 USA
[2] Texas A&M Univ, Biomed Engn Program, College Stn, TX 77845 USA
[3] Univ Texas, Med Branch, Laser & Spect Program, Ctr Biomed Engn, Galveston, TX 77550 USA
关键词
D O I
10.1021/ac9705733
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The mathematical basis of improved calibration through selection of informative variables for partial least-squares calibration has been identified, A theoretical investigation of calibration slopes indicates that including uninformative wavelengths negatively affect calibrations by producing both large relative bias toward zero and small additive bias away from the origin. These theoretical results are found regardless of the noise distribution in the data. Studies are performed to confirm this result using a previously used selection method compared to a new method, which is designed to perform more appropriately when dealing with data having large outlying points by including estimates of spectral residuals. Three different data sets are tested with varying noise distributions. In the first data set, Gaussian and log-normal noise was added to simulated data which included a single peak. Second, near-infrared spectra of glucose in cell culture media taken with an FT-IR spectrometer were analyzed, Finally, dispersive Raman Stokes spectra of glucose dissolved in water were assessed, In every case considered here, improved prediction is produced through selection, but data with different noise characteristics showed varying degrees of improvement depending on the selection method used. The practical results showed that, indeed, including residuals into ranking criteria improves selection for data with noise distributions resulting in large outliers, It was concluded that careful design of a selection algorithm should include consideration of spectral noise distributions in the input data to increase the likelihood of successful and appropriate selection.
引用
收藏
页码:35 / 44
页数:10
相关论文
共 28 条
[1]   Genetic algorithm-based method for selecting wavelengths and model size for use with partial least-squares regression: Application to near-infrared spectroscopy [J].
Bangalore, AS ;
Shaffer, RE ;
Small, GW ;
Arnold, MA .
ANALYTICAL CHEMISTRY, 1996, 68 (23) :4200-4212
[2]   MATRIX REPRESENTATIONS AND CRITERIA FOR SELECTING ANALYTICAL WAVELENGTHS FOR MULTICOMPONENT SPECTROSCOPIC ANALYSIS [J].
BROWN, CW ;
LYNCH, PF ;
OBREMSKI, RJ ;
LAVERY, DS .
ANALYTICAL CHEMISTRY, 1982, 54 (09) :1472-1479
[3]   CHEMOMETRICS AND SPECTRAL FREQUENCY SELECTION [J].
BROWN, PJ ;
SPIEGELMAN, CH ;
DENHAM, MC .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1991, 337 (1647) :311-322
[4]   WAVELENGTH SELECTION IN MULTICOMPONENT NEAR-INFRARED CALIBRATION [J].
BROWN, PJ .
JOURNAL OF CHEMOMETRICS, 1992, 6 (03) :151-161
[6]   THE EFFECT OF IGNORING SMALL MEASUREMENT ERRORS IN PRECISION INSTRUMENT CALIBRATION [J].
CARROLL, RJ ;
SPIEGELMAN, CH .
JOURNAL OF QUALITY TECHNOLOGY, 1986, 18 (03) :170-173
[7]  
Carroll RJ, 1995, MEASUREMENT ERROR NO, P21
[8]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[9]  
DENHAM MC, 1991, THESIS U LIVERPOOL
[10]  
Fuller W. A., 2009, Measurement error models