On the application of ROC analysis to predict classification performance under varying class distributions

被引:72
作者
Webb, GI
Ting, KM
机构
[1] Monash Univ, Sch Comp Sci & Software Engn, Clayton, Vic 3800, Australia
[2] Monash Univ, Gippsland Sch Comp & Informat Technol, Clayton, Vic 3842, Australia
关键词
model evaluation; ROC analysis;
D O I
10.1007/s10994-005-4257-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We counsel caution in the application of ROC analysis for prediction of classifier performance under varying class distributions. We argue that it is not reasonable to expect ROC analysis to provide accurate prediction of model performance under varying distributions if the classes contain causally relevant subclasses whose frequencies may vary at different rates or if there are attributes upon which the classes are causally dependent.
引用
收藏
页码:25 / 32
页数:8
相关论文
共 10 条
[1]   Comparing classifiers when the misallocation costs are uncertain [J].
Adams, NM ;
Hand, DJ .
PATTERN RECOGNITION, 1999, 32 (07) :1139-1147
[2]   Learning changing concepts by exploiting the structure of change [J].
Bartlett, PL ;
Ben-David, S ;
Kulkarni, SR .
MACHINE LEARNING, 2000, 41 (02) :153-174
[3]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[4]  
DUDA OR, 2001, PATTERN CLASSIFICATO
[5]   VERY SIMPLE CLASSIFICATION RULES PERFORM WELL ON MOST COMMONLY USED DATASETS [J].
HOLTE, RC .
MACHINE LEARNING, 1993, 11 (01) :63-91
[6]   Robust classification for imprecise environments [J].
Provost, F ;
Fawcett, T .
MACHINE LEARNING, 2001, 42 (03) :203-231
[7]  
Provost F., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P43
[8]  
PROVOST F, 1998, P 15 INT C MACH LEAR, P43
[9]   MEASURING THE ACCURACY OF DIAGNOSTIC SYSTEMS [J].
SWETS, JA .
SCIENCE, 1988, 240 (4857) :1285-1293
[10]  
Weinstein MC, 1980, CLIN DECISION ANAL