Theoretical justification of wavelength selection in PLS calibration development of a new algorithm

被引:241
作者
Spiegelman, CH [1 ]
McShane, MJ
Goetz, MJ
Motamedi, M
Yue, QL
Coté, GL
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77845 USA
[2] Texas A&M Univ, Biomed Engn Program, College Stn, TX 77845 USA
[3] Univ Texas, Med Branch, Laser & Spect Program, Ctr Biomed Engn, Galveston, TX 77550 USA
关键词
D O I
10.1021/ac9705733
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The mathematical basis of improved calibration through selection of informative variables for partial least-squares calibration has been identified, A theoretical investigation of calibration slopes indicates that including uninformative wavelengths negatively affect calibrations by producing both large relative bias toward zero and small additive bias away from the origin. These theoretical results are found regardless of the noise distribution in the data. Studies are performed to confirm this result using a previously used selection method compared to a new method, which is designed to perform more appropriately when dealing with data having large outlying points by including estimates of spectral residuals. Three different data sets are tested with varying noise distributions. In the first data set, Gaussian and log-normal noise was added to simulated data which included a single peak. Second, near-infrared spectra of glucose in cell culture media taken with an FT-IR spectrometer were analyzed, Finally, dispersive Raman Stokes spectra of glucose dissolved in water were assessed, In every case considered here, improved prediction is produced through selection, but data with different noise characteristics showed varying degrees of improvement depending on the selection method used. The practical results showed that, indeed, including residuals into ranking criteria improves selection for data with noise distributions resulting in large outliers, It was concluded that careful design of a selection algorithm should include consideration of spectral noise distributions in the input data to increase the likelihood of successful and appropriate selection.
引用
收藏
页码:35 / 44
页数:10
相关论文
共 28 条
[21]   GENETIC ALGORITHMS IN WAVELENGTH SELECTION - A COMPARATIVE-STUDY [J].
LUCASIUS, CB ;
BECKERS, MLM ;
KATEMAN, G .
ANALYTICA CHIMICA ACTA, 1994, 286 (02) :135-153
[22]   GENETIC ALGORITHMS FOR LARGE-SCALE OPTIMIZATION IN CHEMOMETRICS - AN APPLICATION [J].
LUCASIUS, CB ;
KATEMAN, G .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1991, 10 (08) :254-261
[23]  
Martens H, 1989, MULTIVARIATE CALIBRA, P314
[24]   Variable selection in multivariate calibration of a spectroscopic glucose sensor [J].
McShane, MJ ;
Cote, GL ;
Spiegelman, C .
APPLIED SPECTROSCOPY, 1997, 51 (10) :1559-1564
[25]   SELECTION OF CALIBRATION MIXTURES AND WAVELENGTHS FOR DIFFERENT MULTIVARIATE CALIBRATION METHODS [J].
NAVARROVILLOSLADA, F ;
PEREZARRIBAS, LV ;
LEONGONZALEZ, ME ;
POLODIEZ, LM .
ANALYTICA CHIMICA ACTA, 1995, 313 (1-2) :93-101
[26]   EFFECTS OF WAVELENGTH RANGE ON THE SIMULTANEOUS QUANTITATION OF POLYNUCLEAR AROMATIC-HYDROCARBONS WITH ABSORPTION-SPECTRA [J].
ROSSI, DT ;
PARDUE, HL .
ANALYTICA CHIMICA ACTA, 1985, 175 (SEP) :153-161
[27]   OPTIMAL WAVELENGTH SELECTION FOR QUANTITATIVE-ANALYSIS [J].
SASAKI, K ;
KAWATA, S ;
MINAMI, S .
APPLIED SPECTROSCOPY, 1986, 40 (02) :185-190
[28]   OPTIMAL SELECTION OF WAVELENGTHS IN SPECTROPHOTOMETRIC MULTI COMPONENT ANALYSIS USING RECURSIVE LEAST-SQUARES [J].
THIJSSEN, PC ;
VOGELS, LJP ;
SMIT, HC ;
KATEMAN, G .
FRESENIUS ZEITSCHRIFT FUR ANALYTISCHE CHEMIE, 1985, 320 (06) :531-540