Wavelength selection for multivariate calibration using a genetic algorithm: A novel initialization strategy

被引:64
作者
Goicoechea, HC
Olivieri, AC
机构
[1] Univ Nacl Rosario, Fac Ciencias Bioquim & Farmaceut, Dept Quim Analit, RA-2000 Rosario, Argentina
[2] Univ Nacl Litoral, Fac Bioquim & Ciencias Biol, Catedra Quim Analit 1, RA-3000 Litoral, Santa Fe, Argentina
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci0255228
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Genetic algorithms and other procedure-, mimicking natural processes are being increasingly used for variable selection, to improve the predictive ability of partial least-squares multivariate calibration. Two issues are critical for the success of genetic algorithms: initialization (setting the first candidates for solving the problem at hand) and overfitting (the tendency to produce excellent results when training, but poor predictions toward fresh samples). A new procedure is presented for sensor selection problems, involving iterative reinitialization based on a statistical analysis of the included sensors. It is shown to give excellent results without the requirement of preparing independent test data sets. Monte Carlo simulations using a theoretical three-component example illustrate how partial least-squares regression greatly benefits from variable selection when the. analyte of interest is diluted, and how the new initialization method compares with other strategies. The new genetic algorithm was applied to five experimental data sets. The target parameters were the concentrations of diluted analytes in four pharmaceutical mixtures studied by UV-visible spectrophotometry and the octane number in gasolines, analyzed by near-infrared spectroscopy.
引用
收藏
页码:1146 / 1153
页数:8
相关论文
共 30 条
  • [1] [Anonymous], 1989, MULTIVARIATE CALIBRA
  • [2] The successive projections algorithm for variable selection in spectroscopic multicomponent analysis
    Araújo, MCU
    Saldanha, TCB
    Galvao, RKH
    Yoneyama, T
    Chame, HC
    Visani, V
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (02) : 65 - 73
  • [3] *ASTM, 269999 ASTM D
  • [4] Genetic algorithm-based method for selecting wavelengths and model size for use with partial least-squares regression: Application to near-infrared spectroscopy
    Bangalore, AS
    Shaffer, RE
    Small, GW
    Arnold, MA
    [J]. ANALYTICAL CHEMISTRY, 1996, 68 (23) : 4200 - 4212
  • [5] Prediction of gasoline properties with near infrared spectroscopy
    Bohács, G
    Ovádi, Z
    Salgó, A
    [J]. JOURNAL OF NEAR INFRARED SPECTROSCOPY, VOL 6 1998, 1998, : 341 - 348
  • [6] Chung H, 2001, B KOREAN CHEM SOC, V22, P37
  • [7] Simultaneous spectrophotometric-multivariate calibration determination of several components of ophthalmic solutions: phenylephrine, chloramphenicol, antipyrine, methylparaben and thimerosal
    Collado, MS
    Mantovani, VE
    Goicoechea, HC
    Olivieri, AC
    [J]. TALANTA, 2000, 52 (05) : 909 - 920
  • [8] DAMIANI PC, UNPUB J PHARM BIOMED
  • [9] DAVIS L, 1987, GENETIC ALGORITHM SI
  • [10] Genetic algorithm-based wavelength selection for the near-infrared determination of glucose in biological matrixes: Initialization strategies and effects of spectral resolution
    Ding, Q
    Small, GW
    Arnold, MA
    [J]. ANALYTICAL CHEMISTRY, 1998, 70 (21) : 4472 - 4479