Mining supervised classification performance studies: A meta-analytic investigation

被引:28
作者
Jamain, Adrien [1 ]
Hand, David J. [2 ,3 ]
机构
[1] BNP Paribas, London NW1 6AA, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England
[3] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London SW7 2AZ, England
关键词
classification rules; supervised classification; neural networks; tree classifiers; logistic regression; nearest neighbor method; Bradley-Terry; meta-analysis; data mining;
D O I
10.1007/s00357-008-9003-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
There have been many comparative studies of classification methods in which real datasets are used as a gauge to assess the relative performance of the methods. Since these comparisons often yield inconclusive or limited results on how methods perform, it is often believed that a broader approach combining these studies would shed some light on this difficult question. This paper describes such an attempt: we have sampled the available literature and created a dataset of 5807 classification results. We show that one of the possible ways to analyze the resulting data is an overall assessment of the classification methods, and we present methods for that particular aim. The merits and demerits of such an approach are discussed, and conclusions are drawn which may assist future research: we argue that the current state of the literature hardly allows large-scale investigations.
引用
收藏
页码:87 / 112
页数:26
相关论文
共 36 条
[1]  
[Anonymous], 1995, COMPREHENSIVE CASE S
[2]  
BATCHELOR BG, 1976, P 3 INT JOINT C PATT
[3]   Constructive training of probabilistic neural networks [J].
Berthold, MR ;
Diamond, J .
NEUROCOMPUTING, 1998, 19 (1-3) :167-183
[4]   EVALUATION OF PATTERN CLASSIFIERS FOR FINGERPRINT AND OCR APPLICATIONS [J].
BLUE, JL ;
CANDELA, GT ;
GROTHER, PJ ;
CHELLAPPA, R ;
WILSON, CL .
PATTERN RECOGNITION, 1994, 27 (04) :485-501
[5]  
BRADLEY RA, 1952, BIOMETRIKA, V39, P324, DOI 10.1093/biomet/39.3-4.324
[6]   Ranking learning algorithms: Using IBL and meta-learning on accuracy and time results [J].
Brazdil, PB ;
Soares, C ;
Da Costa, JP .
MACHINE LEARNING, 2003, 50 (03) :251-277
[7]  
Collett D, 1991, MODELLING BINARY DAT
[8]   NEURAL NETWORKS, DECISION TREE INDUCTION AND DISCRIMINANT-ANALYSIS - AN EMPIRICAL-COMPARISON [J].
CURRAM, SP ;
MINGERS, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1994, 45 (04) :440-450
[9]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157
[10]   A note on comparing classifiers [J].
Duin, RPW .
PATTERN RECOGNITION LETTERS, 1996, 17 (05) :529-536