Class-modeling techniques, classic and new, for old and new problems

被引:152
作者
Forina, M. [1 ]
Oliveri, P. [1 ]
Lanteri, S. [1 ]
Casale, M. [1 ]
机构
[1] Univ Genoa, Fac Farm, Dipartimento Chim & Tecnol Farmaceut Alimentari, Genoa, Italy
关键词
SIMCA; UNEQ; potential functions; multivariate range modeling;
D O I
10.1016/j.chemolab.2008.05.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class-modeling techniques, classic and recent, are studied with special reference with the new applications to data sets characterized by many variables, frequently noisy variables without importance in the characterization of the studied class. UNEQ(based on the hypothesis of multivariate normal distribution and on the Hotelling T(2) statistics), SIMCA (with a model built on the class principal components), POTFUN (Potential Functions Modeling, where the probability distribution is estimated by means of the potential functions), MRM (Multivariate Range Modeling, where the model is obtained with the range of the original variables and of discriminant functions) are compared by means of the sensitivities and specificities of the models evaluated both by means of cross validation and with the model forced to accept all the objects of the modeled category. The parameters used to evaluate the performance of class-modeling techniques are critically reviewed. The performances of class-modeling techniques, both in classification and in modeling, have been evaluated on real data sets, with the original variables and on subsets of variables obtained after elimination of non-discriminant variables. The effect of noisy variables and of deviation from the underlying hypotheses are discussed. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:132 / 148
页数:17
相关论文
共 24 条
[1]   COMPARATIVE-ANALYSIS OF STATISTICAL PATTERN-RECOGNITION METHODS IN HIGH-DIMENSIONAL SETTINGS [J].
AEBERHARD, S ;
COOMANS, D ;
DEVEL, O .
PATTERN RECOGNITION, 1994, 27 (08) :1065-1077
[2]  
Coomans D., 1982, THESIS VRIJE U BRUSS
[3]  
Coomans D., 1986, POTENTIAL PATTERN RE
[4]   UNEQ - A DISJOINT MODELING TECHNIQUE FOR PATTERN-RECOGNITION BASED ON NORMAL-DISTRIBUTION [J].
DERDE, MP ;
MASSART, DL .
ANALYTICA CHIMICA ACTA, 1986, 184 :33-51
[5]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[6]  
FORINA M, 1986, VITIS, V25, P189
[7]   Selection of useful predictors in multivariate calibration [J].
Forina, M ;
Lanteri, S ;
Oliveros, MCC ;
Millan, CP .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2004, 380 (03) :397-418
[8]   A CLASS-MODELING TECHNIQUE BASED ON POTENTIAL FUNCTIONS [J].
FORINA, M ;
ARMANINO, C ;
LEARDI, R ;
DRAVA, G .
JOURNAL OF CHEMOMETRICS, 1991, 5 (05) :435-453
[9]  
FORINA M, 1982, ANN CHIM-ROME, V72, P143
[10]  
FORINA M, 2005, P C 30 MRE AIX PROV, P82