Evaluation Measures for Ordinal Regression

被引:147
作者
Baccianella, Stefano [1 ]
Esuli, Andrea [1 ]
Sebastiani, Fabrizio [1 ]
机构
[1] CNR, Ist Sci & Tecnol Informaz, I-56124 Pisa, Italy
来源
2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS | 2009年
关键词
Ordinal regression; Ordinal classification; Evaluation measures; Class imbalance; Product reviews;
D O I
10.1109/ISDA.2009.230
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ordinal regression (OR also known as ordinal classification) has received increasing attention in recent times, due to its importance in IR applications such as learning to rank and product review rating. However, research has not paid attention to the fact that typical applications of OR often involve datasets that are highly imbalanced. An imbalanced dataset has the consequence that, when testing a system with an evaluation measure conceived for balanced datasets, a trivial system assigning all items to a single class (typically, the majority class) may even outperform genuinely engineered systems. Moreover, if this evaluation measure is used for parameter optimization, a parameter choice may result that makes the system behave very much like a trivial system. In order to avoid this, evaluation measures that can handle imbalance must be used. We propose a simple way to turn standard measures for OR into ones robust to imbalance. We also show that, once used on balanced datasets, the two versions of each measure coincide, and therefore argue that our measures should become the standard choice for OR.
引用
收藏
页码:283 / 287
页数:5
相关论文
共 28 条
[1]  
[Anonymous], 2004, ACM Sigkdd Explorations Newsletter
[2]  
[Anonymous], 2004, P 21 INT C MACHINE L
[3]  
[Anonymous], PAC AS C KNOWL DISC
[4]  
[Anonymous], ADV NEURAL INFORM PR
[5]  
Baccianella S, 2009, LECT NOTES COMPUT SC, V5478, P461, DOI 10.1007/978-3-642-00958-7_41
[6]  
Beineke P., 2004, P 12 ANN M ASS COMPU, P263, DOI DOI 10.3115/1218955.1218989
[7]  
Blitzer John., 2007, Annual Meeting-Association For Computational Linguistics, V45, P440
[8]  
Bo P., 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI DOI 10.1561/1500000011
[9]  
Chu W., 2005, P INT C MACH LEARN N, P145
[10]  
Crammer K, 2002, ADV NEUR IN, V14, P641