A theoretical and experimental analysis of linear combiners for multiple classifier systems

被引:213
作者
Fumera, G [1 ]
Roli, F [1 ]
机构
[1] Univ Cagliari, Dept Elect & Elect Engn, I-09123 Cagliari, Italy
关键词
multiple classifier systems; linear combiners; classifier fusion; pattern classification;
D O I
10.1109/TPAMI.2005.109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a theoretical and experimental analysis of linear combiners for multiple classifier systems is presented. Although linear combiners are the most frequently used combining rules, many important issues related to their operation for pattern classification tasks lack a theoretical basis. After a critical review of the framework developed in works by Tumer and Ghosh [30], [31] on which our analysis is based, we focus on the simplest and most widely used implementation of linear combiners, which consists of assigning a nonnegative weight to each individual classifier. Moreover, we consider the ideal performance of this combining rule, i.e., that achievable when the optimal values of the weights are used. We do not consider the problem of weights estimation, which has been addressed in the literature. Our theoretical analysis shows how the performance of linear combiners, in terms of misclassification probability, depends on the performance of individual classifiers, and on the correlation between their outputs. In particular, we evaluate the ideal performance improvement that can be achieved using the weighted average over the simple average combining rule and investigate in what way it depends on the individual classifiers. Experimental results on real data sets show that the behavior of linear combiners agrees with the predictions of our analytical model. Finally, we discuss the contribution to the state of the art and the practical relevance of our theoretical and experimental analysis of linear combiners for multiple classifier systems.
引用
收藏
页码:942 / 956
页数:15
相关论文
共 35 条
[1]  
Alexandre LA, 2000, INT C PATT RECOG, P495, DOI 10.1109/ICPR.2000.906120
[2]  
[Anonymous], THESIS PURDUE U
[3]  
[Anonymous], COMBINING ARTICIAL N
[4]   Parallel consensual neural networks [J].
Benediktsson, JA ;
Sveinsson, JR ;
Ersoy, OK ;
Swain, PH .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01) :54-64
[5]  
Bishop C. M., 1996, Neural networks for pattern recognition
[6]  
Breiman L, 1996, MACH LEARN, V24, P49
[7]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[8]  
Duda R. O., 2000, PATTERN CLASSIFICATI
[9]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[10]  
Friedman J., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5