Bias and controversy in evaluation systems

被引:18
作者
Lauw, Hady W. [1 ]
Lim, Ee-Peng [2 ]
Wang, Ke [3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Singapore Management Univ, Sch Informat Syst, Singapore 178902, Singapore
[3] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
bias; controversy; evaluation; rating; link analysis; social network mining;
D O I
10.1109/TKDE.2008.77
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation is prevalent in real life. With the advent of Web 2.0, online evaluation has become an important feature in many applications that involve information (e. g., video, photo, and audio) sharing and social networking (e. g., blogging). In these evaluation settings, a set of reviewers assign scores to a set of objects. As part of the evaluation analysis, we want to obtain fair reviews for all the given objects. However, the reality is that reviewers may deviate in their scores assigned to the same object, due to the potential "bias" of reviewers or "controversy"of objects. The statistical approach of averaging deviations to determine bias and controversy assumes that all reviewers and objects should be given equal weight. In this paper, we look beyond this assumption and propose an approach based on the following observations: 1) evaluation is "subjective,"as reviewers and objects have varying bias and controversy, respectively, and 2) bias and controversy are mutually dependent. These observations underlie our proposed reinforcement-based model to determine bias and controversy simultaneously. Our approach also quantifies "evidence,"which reveals the degree of confidence with which bias and controversy have been derived. This model is shown to be effective by experiments on real-life and synthetic data sets.
引用
收藏
页码:1490 / 1504
页数:15
相关论文
共 41 条
[21]  
Golub GH., 2013, Matrix Computations, DOI 10.56021/9781421407944
[22]   Grading leniency is a removable contaminant of student ratings [J].
Greenwald, AG ;
Gillmore, GM .
AMERICAN PSYCHOLOGIST, 1997, 52 (11) :1209-1217
[23]  
Han J., 2006, DATA MINING CONCEPTS
[24]   Topic-sensitive PageRank: A context-sensitive ranking algorithm for Web search [J].
Haveliwala, TH .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) :784-796
[25]   Evaluating collaborative filtering recommender systems [J].
Herlocker, JL ;
Konstan, JA ;
Terveen, K ;
Riedl, JT .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) :5-53
[26]  
Hettich S., 2006, P 12 ACM SIGKDD INT, P862, DOI DOI 10.1145/1150402.1150521
[27]   Authoritative sources in a hyperlinked environment [J].
Kleinberg, JM .
JOURNAL OF THE ACM, 1999, 46 (05) :604-632
[28]  
Knorr E. M., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P219
[29]  
Lauw HW, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P539
[30]   African elephants show high levels of interest in the skulls and ivory of their own species [J].
McComb, Karen ;
Baker, Lucy ;
Moss, Cynthia .
BIOLOGY LETTERS, 2006, 2 (01) :26-28