Confidence intervals of the prediction ability and performance scores of classifications methods

被引:9
作者
Forina, M [1 ]
Lanteri, S [1 ]
Rosso, S [1 ]
机构
[1] Univ Genoa, Dept Chem & Technol Drugs & Foods, I-16147 Genoa, Italy
关键词
classification; class modeling; confidence intervals; prediction rate; classification performance score;
D O I
10.1016/S0169-7439(01)00129-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chemometricians widely use multivariate classification and class-modeling techniques in identity problems, in multivariate quality control, to obtain homogeneous set for multivariate calibration. The performance of such techniques is measured by parameters related to the prediction rate, the percentage of correct classifications of the objects in the evaluation set, i.e., the objects not used to develop the classification model. Sometimes, the prediction rate is discussed with reference to the no-model rate. In this paper, we suggest to compute a confidence interval for the prediction rate and the derived quantities, as people usually do in the case of measured chemical quantities. The statistical bases of this confidence interval are presented, with an alternative estimate other than those previously known. Tables with the limits of the confidence intervals for some usual cases (two or three categories) are reported. To take into account the effect of the number of samples used to evaluate the prediction ability, a classification performance score is introduced, whose value is zero when the classification rate is compatible with the hypothesis of random assignment. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:121 / 132
页数:12
相关论文
共 18 条
[1]  
[Anonymous], 1983, Food Research and Data Analysis
[2]  
FORINA M, 1986, VITIS, V25, P189
[3]  
FORINA M, QPARVUS RELEASE 3 0
[4]  
FRANK IE, 1994, DATA ANAL HDB, P32
[5]   STATISTICAL ANALYSIS OF MULTIVARIATE CHROMATOGRAPHIC DATA ON NATURAL MIXTURES, WITH PARTICULAR REFERENCE TO PEPPERMINT OILS [J].
HARTMANN, N ;
HAWKES, SJ .
JOURNAL OF CHROMATOGRAPHIC SCIENCE, 1970, 8 (10) :610-&
[6]   COMPUTERIZED LEARNING MACHINES APPLIED TO CHEMICAL PROBLEMS - INVESTIGATION OF CONVERGENCE RATE AND PREDICTIVE ABILITY OF ADAPTIVE BINARY PATTERN CLASSIFIERS [J].
JURS, PC ;
KOWALSKI, BR ;
ISENHOUR, TL ;
REILLEY, CN .
ANALYTICAL CHEMISTRY, 1969, 41 (06) :690-&
[7]   COMPUTERIZED LEARNING MACHINES APPLIED TO CHEMICAL PROBLEMS - MOLECULAR FORMULA DETERMINATION FROM LOW RESOLUTION MASS SPECTROMETRY [J].
JURS, PC ;
KOWALSKI, BR ;
ISENHOUR, TL .
ANALYTICAL CHEMISTRY, 1969, 41 (01) :21-&
[8]   AN INVESTIGATION OF COMBINED PATTERNS FROM DIVERSE ANALYTICAL DATA USING COMPUTERIZED LEARNING MACHINES [J].
JURS, PC ;
KOWALSKI, BR ;
ISENHOUR, TL ;
REILLEY, CN .
ANALYTICAL CHEMISTRY, 1969, 41 (14) :1949-&
[9]  
KOHAVI R, 1995, P IJCAI 95
[10]   COMPUTERIZED LEARNING MACHINES APPLIED TO CHEMICAL PROBLEMS - MULTICATEGORY PATTERN CLASSIFICATION BY LEAST SQUARES [J].
KOWALSKI, BR ;
JURS, PC ;
ISENHOUR, TL ;
REILLEY, CN .
ANALYTICAL CHEMISTRY, 1969, 41 (06) :695-&