SELECTION OF OPTIMUM TRAINING SETS FOR USE IN PATTERN-RECOGNITION ANALYSIS OF CHEMICAL-DATA

被引:35
作者
CARPENTER, SE [1 ]
SMALL, GW [1 ]
机构
[1] UNIV IOWA,DEPT CHEM,IOWA CITY,IA 52242
关键词
PATTERN RECOGNITION; DISCRIMINANTS; TRAINING SETS;
D O I
10.1016/S0003-2670(00)83002-0
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
An algorithm is described for the automated selection of optimum training sets for use in pattern recognition studies. A direct calculation is implemented that ensures the data space is sampled equitably in the construction of the training sets. Through this approach, the variety of data in the training set can be maximized while keeping the number of patterns to a specified minimum. A large volume of passive Fourier transform infrared (FT-IR) remote sensing data is used to show the utility of the technique. The algorithm is used to select numerous training sets of various sizes. These training sets are used to develop linear discriminants for pattern recognition. The performance of the discriminants developed from each training set is subsequently evaluated based on their ability to classify a large set of patterns not included in the training procedure. The results are also compared with the performance of discriminants developed using numerous randomly selected training sets. The training sets selected with the new algorithm produce pattern recognition results which are markedly superior to those produced by the randomly selected training sets.
引用
收藏
页码:305 / 321
页数:17
相关论文
共 20 条
[1]   CLASSIFICATION INTO 2 MULTIVARIATE NORMAL-DISTRIBUTIONS WITH DIFFERENT COVARIANCE MATRICES [J].
ANDERSON, TW ;
BAHADUR, RR .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (02) :420-&
[2]   HIGH-SPEED ALGORITHM FOR SIMPLEX OPTIMIZATION CALCULATIONS [J].
BRISSEY, GF ;
SPENCER, RB ;
WILKINS, CL .
ANALYTICAL CHEMISTRY, 1979, 51 (13) :2295-2297
[3]   SUPERVISED PATTERN-RECOGNITION - THE IDEAL METHOD [J].
DERDE, MP ;
MASSART, DL .
ANALYTICA CHIMICA ACTA, 1986, 191 :1-16
[4]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :417-441
[5]   PATTERN-RECOGNITION USED TO INVESTIGATE MULTIVARIATE DATA IN ANALYTICAL-CHEMISTRY [J].
JURS, PC .
SCIENCE, 1986, 232 (4755) :1219-1224
[6]   DETERMINATION OF NUMBER OF FACTORS AND EXPERIMENTAL ERROR IN A DATA MATRIX [J].
MALINOWSKI, ER .
ANALYTICAL CHEMISTRY, 1977, 49 (04) :612-617
[7]  
MARTENS H, 1989, MULTIVARIATE CALIBRA, P111
[8]  
MASSART DL, 1988, CHEMOMETRICS TXB, pCH23
[9]   A SIMPLEX-METHOD FOR FUNCTION MINIMIZATION [J].
NELDER, JA ;
MEAD, R .
COMPUTER JOURNAL, 1965, 7 (04) :308-313
[10]   SIMPLEX PATTERN-RECOGNITION [J].
RITTER, GL ;
LOWRY, SR ;
WILKINS, CL ;
ISENHOUR, TL .
ANALYTICAL CHEMISTRY, 1975, 47 (12) :1951-1956