Accurate and diverse recommendations via eliminating redundant correlations

被引:101
作者
Zhou, Tao [1 ,2 ,3 ]
Su, Ri-Qi [1 ,2 ]
Liu, Run-Ran [1 ,2 ]
Jiang, Luo-Luo [1 ,2 ]
Wang, Bing-Hong [1 ,2 ,4 ]
Zhang, Yi-Cheng [3 ,4 ]
机构
[1] Univ Sci & Technol China, Dept Modern Phys, Hefei 230026, Anhui, Peoples R China
[2] Univ Sci & Technol China, Ctr Nonlinear Sci, Hefei 230026, Anhui, Peoples R China
[3] Univ Fribourg, Dept Phys, CH-1700 Fribourg, Switzerland
[4] Shanghai Univ Sci & Technol, Res Ctr Complex Syst Sci, Shanghai 200093, Peoples R China
来源
NEW JOURNAL OF PHYSICS | 2009年 / 11卷
基金
瑞士国家科学基金会; 中国国家自然科学基金;
关键词
SYSTEMS;
D O I
10.1088/1367-2630/11/12/123008
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this paper, based on a weighted projection of a bipartite user-object network, we introduce a personalized recommendation algorithm, called network-based inference (NBI), which has higher accuracy than the classical algorithm, namely collaborative filtering. In NBI, the correlation resulting from a specific attribute may be repeatedly counted in the cumulative recommendations from different objects. By considering the higher order correlations, we design an improved algorithm that can, to some extent, eliminate the redundant correlations. We test our algorithm on two benchmark data sets, MovieLens and Netflix. Compared with NBI, the algorithmic accuracy, measured by the ranking score, can be further improved by 23% for MovieLens and 22% for Netflix. The present algorithm can even outperform the Latent Dirichlet Allocation algorithm, which requires much longer computational time. Furthermore, most previous studies considered the algorithmic accuracy only; in this paper, we argue that the diversity and popularity, as two significant criteria of algorithmic performance, should also be taken into account. With more or less the same accuracy, an algorithm giving higher diversity and lower popularity is more favorable. Numerical results show that the present algorithm can outperform the standard one simultaneously in all five adopted metrics: lower ranking score and higher precision for accuracy, larger Hamming distance and lower intra-similarity for diversity, as well as smaller average degree for popularity.
引用
收藏
页数:19
相关论文
共 35 条
[1]   Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions [J].
Adomavicius, G ;
Tuzhilin, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) :734-749
[2]  
Ali Kamal., 2004, ACM SIGKDD INT C KNO, P394, DOI DOI 10.1145/1014052.1014097
[3]  
[Anonymous], 2008, NIPS
[4]  
Bennett J., 2007, P KDD CUP WORKSH, P35
[5]   Adaptive interfaces for ubiquitous web access [J].
Billsus, D ;
Clifford, AB ;
Evans, G ;
Gladish, B ;
Pazzani, M .
COMMUNICATIONS OF THE ACM, 2002, 45 (05) :34-38
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[8]   Graph structure in the Web [J].
Broder, A ;
Kumar, R ;
Maghoul, F ;
Raghavan, P ;
Rajagopalan, S ;
Stata, R ;
Tomkins, A ;
Wiener, J .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :309-320
[9]   Hybrid recommender systems: Survey and experiments [J].
Burke, R .
USER MODELING AND USER-ADAPTED INTERACTION, 2002, 12 (04) :331-370
[10]   Eigentaste: A constant time collaborative filtering algorithm [J].
Goldberg, K ;
Roeder, T ;
Gupta, D ;
Perkins, C .
INFORMATION RETRIEVAL, 2001, 4 (02) :133-151