Classifier evaluation under limited resources

被引:13
作者
Arbela, Reuven
Rokach, Lior [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Informat Syst Engn, IL-84105 Beer Sheva, Israel
[2] Tel Aviv Univ, Dept Ind Engn, IL-69978 Tel Aviv, Israel
关键词
classification; evaluation measures; hit-rate; recall; receiver operating characteristic;
D O I
10.1016/j.patrec.2006.03.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing evaluations measures are insufficient when probabilistic classifiers are used for choosing objects to be included in a limited quota. This paper reviews performance measures that suit probabilistic classification and introduce two novel performance measures that can be used effectively for this task. It then investigates when to use each of the measures and what purpose each one of them serves. The use of these measures is demonstrated on a real life dataset obtained from the human resource field and is validated on set of benchmark datasets. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1619 / 1631
页数:13
相关论文
共 11 条
[1]  
AN A, 2001, IEEE INT C DAT MIN 2
[2]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[3]  
[Anonymous], 1998, UCI REPOSITORY MACHI
[4]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[5]  
Coppock D. S., 2002, DATA MODELING MINING
[6]  
Kolcz A., 2003, WORKSH LEARN IMB DAT
[7]  
LEE SS, 2000, COMPUT STAT DATA ANA, V34
[8]  
Levin N, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P1261, DOI 10.1007/0-387-25465-X_61
[9]   Robust classification for imprecise environments [J].
Provost, F ;
Fawcett, T .
MACHINE LEARNING, 2001, 42 (03) :203-231
[10]  
Provost F.J., 1998, P 15 INT C MACH LEAR, P445