Stability of Recommendation Algorithms

被引:49
作者
Adomavicius, Gediminas [1 ]
Zhang, Jingjing [2 ]
机构
[1] Univ Minnesota, Carlson Sch Management, Dept Informat & Decis Sci, Minneapolis, MN 55455 USA
[2] Indiana Univ, Kelley Sch Business, Dept Operat & Decis Technol, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
Algorithms; Measurement; Performance; Reliability; Recommender systems; evaluation of recommender systems; performance measures; recommendation stability; recommendation accuracy; collaborative filtering; SYSTEMS; TRUST; ACCEPTANCE;
D O I
10.1145/2382438.2382442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The article explores stability as a new measure of recommender systems performance. Stability is defined to measure the extent to which a recommendation algorithm provides predictions that are consistent with each other. Specifically, for a stable algorithm, adding some of the algorithm's own predictions to the algorithm's training data (for example, if these predictions were confirmed as accurate by users) would not invalidate or change the other predictions. While stability is an interesting theoretical property that can provide additional understanding about recommendation algorithms, we believe stability to be a desired practical property for recommender systems designers as well, because unstable recommendations can potentially decrease users' trust in recommender systems and, as a result, reduce users' acceptance of recommendations. In this article, we also provide an extensive empirical evaluation of stability for six popular recommendation algorithms on four real-world datasets. Our results suggest that stability performance of individual recommendation algorithms is consistent across a variety of datasets and settings. In particular, we find that model-based recommendation algorithms consistently demonstrate higher stability than neighborhood-based collaborative filtering techniques. In addition, we perform a comprehensive empirical analysis of many important factors (e.g., the sparsity of original rating data, normalization of input data, the number of new incoming ratings, the distribution of incoming ratings, the distribution of evaluation data, etc.) and report the impact they have on recommendation stability.
引用
收藏
页数:31
相关论文
共 48 条
[41]  
Rigby C., 2011, Internet Retailing webinars. Review: Boosting email marketing revenue with personalised recommendations by Silverpop
[42]  
Sarwar B., 2001, PROC ACM INT WORLD W, P285
[43]  
Sarwar B. M., 1998, ACM 1998 Conference on Computer Supported Cooperative Work. Proceedings. CSCW 98, P345, DOI 10.1145/289444.289509
[44]  
Shani G, 2011, RECOMMENDER SYSTEMS HANDBOOK, P257, DOI 10.1007/978-0-387-85820-3_8
[45]  
Simitses G., 2005, FUNDAMENTALS STRUCTU
[46]   A THEORY OF CROSS-VALIDATION ERROR [J].
TURNEY, P .
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1994, 6 (04) :361-391
[47]   BIAS AND THE QUANTIFICATION OF STABILITY [J].
TURNEY, P .
MACHINE LEARNING, 1995, 20 (1-2) :23-33
[48]   Factors affecting the acceptance of expert advice [J].
Van Swol, LM ;
Sniezek, JA .
BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 2005, 44 :443-461