Confidence transformation for combining classifiers

被引:25
作者
Liu, CL
Hao, HW
Sako, H
机构
[1] Hitachi Ltd, Cent Res Lab, Tokyo 1858601, Japan
[2] Univ Sci & Technol Beijing, Dept Comp Sci, Beijing 100083, Peoples R China
关键词
classifier combination; confidence transformation; evidence combination; Gaussian modeling; logistic regression; pattern classification;
D O I
10.1007/s10044-003-0199-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates a number of confidence transformation methods for measurement-level combination of classifiers. Each confidence transformation method is the combination of a scaling function and an activation function. The activation functions correspond to different types of confidences: likelihood (exponential), log-likelihood, sigmoid, and the evidence combination of sigmoid measures. The sigmoid and evidence measures serve as approximates to class probabilities. The scaling functions are derived by Gaussian density modeling, logistic regression with variable inputs, etc. We test the confidence transformation methods in handwritten digit recognition by combining variable sets of classifiers: neural classifiers only, distance classifiers only, strong classifiers, and mixed strong/weak classifiers. The results show that confidence transformation is efficient to improve the combination performance in all the settings. The normalization of class probabilities to unity of sum is shown to be detrimental to the combination performance. Comparing the scaling functions, the Gaussian method and the logistic regression perform well in most cases. Regarding the confidence types, the sigmoid and evidence measures perform well in most cases, and the evidence measure generally outperforms the sigmoid measure. We also show that the confidence transformation methods are highly robust to the validation sample size in parameter estimation.
引用
收藏
页码:2 / 17
页数:16
相关论文
共 49 条
[1]  
[Anonymous], ADV LARGE MARGIN CLA
[2]  
[Anonymous], 1996, PATTERN CLASSIFICATI
[3]  
Atukorale A. S., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P37, DOI 10.1109/ICDAR.1999.791719
[4]  
Barnett J.A., 1981, P 7 INT JOINT C ART, P868
[5]  
Bengio S., 2002, Information Fusion, V3, P267, DOI 10.1016/S1566-2535(02)00089-1
[6]  
BIRDLE JS, 1990, NEUROCOMPUTING ALGOR, P227
[7]  
Bishop C. M., 1996, Neural networks for pattern recognition
[8]   Reliability parameters to improve combination strategies in multi-expert systems [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Tortorella, F ;
Vento, M .
PATTERN ANALYSIS AND APPLICATIONS, 1999, 2 (03) :205-214
[9]  
Denker J. S., 1991, ADV NEURAL INFORM PR, DOI DOI 10.5555/2986766.2986882
[10]  
Duin RPW, 2002, INT C PATT RECOG, P765, DOI 10.1109/ICPR.2002.1048415