Mining supervised classification performance studies: A meta-analytic investigation

被引：28

作者：

Jamain, Adrien ^{[1
]}

Hand, David J. ^{[2
,3
]}

机构：

[1] BNP Paribas, London NW1 6AA, England

[2] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England

[3] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London SW7 2AZ, England

来源：

JOURNAL OF CLASSIFICATION | 2008年 / 25卷 / 01期

关键词：

classification rules; supervised classification; neural networks; tree classifiers; logistic regression; nearest neighbor method; Bradley-Terry; meta-analysis; data mining;

D O I：

10.1007/s00357-008-9003-y

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

There have been many comparative studies of classification methods in which real datasets are used as a gauge to assess the relative performance of the methods. Since these comparisons often yield inconclusive or limited results on how methods perform, it is often believed that a broader approach combining these studies would shed some light on this difficult question. This paper describes such an attempt: we have sampled the available literature and created a dataset of 5807 classification results. We show that one of the possible ways to analyze the resulting data is an overall assessment of the classification methods, and we present methods for that particular aim. The merits and demerits of such an approach are discussed, and conclusions are drawn which may assist future research: we argue that the current state of the literature hardly allows large-scale investigations.

引用

页码：87 / 112

页数：26

共 36 条

[1]

[Anonymous], 1995, COMPREHENSIVE CASE S

[2]

BATCHELOR BG, 1976, P 3 INT JOINT C PATT

[3] Constructive training of probabilistic neural networks [J].