Time-aware recommender systems: a comprehensive survey and analysis of existing evaluation protocols

被引:328
作者
Campos, Pedro G. [1 ,2 ]
Diez, Fernando [2 ]
Cantador, Ivan [2 ]
机构
[1] Univ Bio Bio, Dept Informat Syst, Concepcion, Chile
[2] Univ Autonoma Madrid, Dept Comp Sci, Madrid, Spain
关键词
Time-aware recommender systems; Context-aware recommender systems; Evaluation methodologies; Survey; CONTEXTUAL INFORMATION; ACCURACY; MODEL;
D O I
10.1007/s11257-012-9136-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Exploiting temporal context has been proved to be an effective approach to improve recommendation performance, as shown, e.g. in the Netflix Prize competition. Time-aware recommender systems (TARS) are indeed receiving increasing attention. A wide range of approaches dealing with the time dimension in user modeling and recommendation strategies have been proposed. In the literature, however, reported results and conclusions about how to incorporate and exploit time information within the recommendation processes seem to be contradictory in some cases. Aiming to clarify and address existing discrepancies, in this paper we present a comprehensive survey and analysis of the state of the art on TARS. The analysis show that meaningful divergences appear in the evaluation protocols used-metrics and methodologies. We identify a number of key conditions on offline evaluation of TARS, and based on these conditions, we provide a comprehensive classification of evaluation protocols for TARS. Moreover, we propose a methodological description framework aimed to make the evaluation process fair and reproducible. We also present an empirical study on the impact of different evaluation protocols on measuring relative performances of well-known TARS. The results obtained show that different uses of the above evaluation conditions yield to remarkably distinct performance and relative ranking values of the recommendation approaches. They reveal the need of clearly stating the evaluation conditions used to ensure comparability and reproducibility of reported results. From our analysis and experiments, we finally conclude with methodological issues a robust evaluation of TARS should take into consideration. Furthermore we provide a number of general guidelines to select proper conditions for evaluating particular TARS.
引用
收藏
页码:67 / 119
页数:53
相关论文
共 100 条
[1]
Incorporating contextual information in recommender systems using a multidimensional approach [J].
Adomavicius, G ;
Sankaranarayanan, R ;
Sen, S ;
Tuzhilin, A .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2005, 23 (01) :103-145
[2]
Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions [J].
Adomavicius, G ;
Tuzhilin, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) :734-749
[3]
Adomavicius G., 2001, Electronic Commerce, V2232, P180
[4]
[Anonymous], 2005, P 14 INT C WORLD WID, DOI DOI 10.1145/1060745.1060754
[5]
[Anonymous], 2003, Proceedings of CHI 2003: Human Factorsin Computing Systems
[6]
[Anonymous], 2006, Proceedings of the 17th Australasian Database Conference-Volume
[7]
[Anonymous], 2010, P 16 ACM SIGKDD INT
[8]
[Anonymous], 2011, Proceedings of the fifth ACM conference on Recommender systems, DOI [10.1145/2043932.2043951, DOI 10.1145/2043932.2043951]
[9]
[Anonymous], 2009, Proceedings of the Third ACM Conference on Recommender Systems, RecSys'09, DOI DOI 10.1145/1639714.1639764
[10]
Ardissono L., 2004, USER MODELING RECOMM