Stability of Recommendation Algorithms

被引:49
作者
Adomavicius, Gediminas [1 ]
Zhang, Jingjing [2 ]
机构
[1] Univ Minnesota, Carlson Sch Management, Dept Informat & Decis Sci, Minneapolis, MN 55455 USA
[2] Indiana Univ, Kelley Sch Business, Dept Operat & Decis Technol, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
Algorithms; Measurement; Performance; Reliability; Recommender systems; evaluation of recommender systems; performance measures; recommendation stability; recommendation accuracy; collaborative filtering; SYSTEMS; TRUST; ACCEPTANCE;
D O I
10.1145/2382438.2382442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The article explores stability as a new measure of recommender systems performance. Stability is defined to measure the extent to which a recommendation algorithm provides predictions that are consistent with each other. Specifically, for a stable algorithm, adding some of the algorithm's own predictions to the algorithm's training data (for example, if these predictions were confirmed as accurate by users) would not invalidate or change the other predictions. While stability is an interesting theoretical property that can provide additional understanding about recommendation algorithms, we believe stability to be a desired practical property for recommender systems designers as well, because unstable recommendations can potentially decrease users' trust in recommender systems and, as a result, reduce users' acceptance of recommendations. In this article, we also provide an extensive empirical evaluation of stability for six popular recommendation algorithms on four real-world datasets. Our results suggest that stability performance of individual recommendation algorithms is consistent across a variety of datasets and settings. In particular, we find that model-based recommendation algorithms consistently demonstrate higher stability than neighborhood-based collaborative filtering techniques. In addition, we perform a comprehensive empirical analysis of many important factors (e.g., the sparsity of original rating data, normalization of input data, the number of new incoming ratings, the distribution of incoming ratings, the distribution of evaluation data, etc.) and report the impact they have on recommendation stability.
引用
收藏
页数:31
相关论文
共 48 条
[1]   Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions [J].
Adomavicius, G ;
Tuzhilin, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) :734-749
[2]  
Adomavicius G, 2010, P 4 ACM C REC SYST R, DOI DOI 10.1145/1864708.1864722
[3]  
[Anonymous], 2005, P 10 INT C INT US IN, DOI DOI 10.1145/1040830.1040870
[4]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[5]   Fab: Content-based, collaborative recommendation [J].
Balabanovic, M ;
Shoham, Y .
COMMUNICATIONS OF THE ACM, 1997, 40 (03) :66-72
[6]  
BELL R.M., 2007, P 7 IEEE INF C DAT M
[7]  
Bell R. M., 2007, KDD CUP WORKSH 13 AC, P7, DOI DOI 10.1007/S007790170019
[8]  
Bell Robert M., 2007, Acm Sigkdd Explorations Newsletter, V9, P75
[9]  
Benbasat I., 2005, J ASSOC INF SYST, V6, P72, DOI [DOI 10.17705/1JAIS.00065, https://doi.org/10.17705/1jais.00065]
[10]  
Bennett J., 2007, P KDD CUP WORKSH NEW