Wavelength selection and optimization of pattern recognition methods using the genetic algorithm

被引:63
作者
Smith, BM [1 ]
Gemperline, PJ [1 ]
机构
[1] E Carolina Univ, Dept Chem, Coll Arts & Sci, Greenville, NC 27858 USA
关键词
genetic algorithm; Mahalanobis distance method; microcrystalline cellulose;
D O I
10.1016/S0003-2670(00)01114-4
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A genetic algorithm (GA) method for wavelength selection and optimization of near-infrared (NIR) pattern recognition methods was developed to reduce misclassification errors of similar materials. Our goal was to automate completely the process of producing pattern recognition models, consequently, we felt it was important to include pre-processing options, the number of principal components and wavelength selection in the chromosomes. The SIMCA residual variance analysis and the Mahalanobis distance methods were used to classify samples of three different types of microcrystalline cellulose (Avicel PH101, PH102, and RC581) and sulfamethoxazole (SMX). Without GA optimization, approximately 15% of Avicel PH101 and PH102 test samples were misclassified since their NIR spectra are very similar. The GA was used to optimize pattern recognition performance on training sets using a figure of merit designed to maximize correct classification of acceptable samples and minimize classification of unacceptable samples or samples of dissimilar materials. After GA optimization of pattern recognition parameters, 100% correct classification of a validation set was achieved using both the residual variance analysis and the Mahalanobis distance methods. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:167 / 177
页数:11
相关论文
共 28 条
[1]   Near-IR detection of polymorphism and process-related substances [J].
Aldridge, PK ;
Evans, CL ;
Ward, HW ;
Colgan, ST ;
Boyer, N ;
Gemperline, PJ .
ANALYTICAL CHEMISTRY, 1996, 68 (06) :997-1002
[2]   IDENTIFICATION OF TABLET FORMULATIONS INSIDE BLISTER PACKAGES BY NEAR-INFRARED SPECTROSCOPY [J].
ALDRIDGE, PK ;
MUSHINSKY, RF ;
ANDINO, MM ;
EVANS, CL .
APPLIED SPECTROSCOPY, 1994, 48 (10) :1272-1276
[3]   Genetic-algorithm-based wavelength selection in multicomponent spectrometric determinations by PLS: Application on indomethacin and acemethacin mixture [J].
Arcos, MJ ;
Ortiz, MC ;
Villahoz, B ;
Sarabia, LA .
ANALYTICA CHIMICA ACTA, 1997, 339 (1-2) :63-77
[4]   Genetic algorithm-based method for selecting wavelengths and model size for use with partial least-squares regression: Application to near-infrared spectroscopy [J].
Bangalore, AS ;
Shaffer, RE ;
Small, GW ;
Arnold, MA .
ANALYTICAL CHEMISTRY, 1996, 68 (23) :4200-4212
[5]   CLASSIFIER SYSTEMS AND GENETIC ALGORITHMS [J].
BOOKER, LB ;
GOLDBERG, DE ;
HOLLAND, JH .
ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) :235-282
[6]   NEAR-INFRARED SPECTROSCOPY AS AN ALTERNATIVE TO BIOLOGICAL TESTING FOR QUALITY-CONTROL OF HYALURONAN - COMPARISON OF DATA PREPROCESSING METHODS FOR CLASSIFICATION [J].
CARLSSON, AE ;
JANNE, KLR .
APPLIED SPECTROSCOPY, 1995, 49 (07) :1037-1040
[7]   USES OF NEAR-INFRARED SPECTROSCOPY IN PHARMACEUTICAL ANALYSIS [J].
CIURCZAK, EW .
APPLIED SPECTROSCOPY REVIEWS, 1987, 23 (1-2) :147-163
[8]   NUMERIC GENETIC ALGORITHM .1. THEORY, ALGORITHM AND SIMULATED EXPERIMENTS [J].
CONG, PS ;
LI, TH .
ANALYTICA CHIMICA ACTA, 1994, 293 (1-2) :191-203
[9]  
CORTI P, 1992, PROCESS CONTR QUAL, V2, P131
[10]  
Davis L., 1987, GENETIC ALGORITHMS S