An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms

被引:450
作者
Herlocker, J
Konstan, JA
Riedl, J
机构
[1] Oregon State Univ, Dept Comp Sci, Corvallis, OR 97331 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Grp Lens Res Project, Minneapolis, MN 55455 USA
来源
INFORMATION RETRIEVAL | 2002年 / 5卷 / 04期
关键词
collaborative filtering; information filtering; empirical studies; preference prediction;
D O I
10.1023/A:1020443909834
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collaborative filtering systems predict a user's interest in new items based on the recommendations of other people with similar interests. Instead of performing content indexing or content analysis, collaborative filtering systems rely entirely on interest ratings from members of a participating community. Since predictions are based on human ratings, collaborative filtering systems have the potential to provide filtering based on complex attributes, such as quality, taste, or aesthetics. Many implementations of collaborative filtering apply some variation of the neighborhood-based prediction algorithm. Many variations of similarity metrics, weighting approaches, combination measures, and rating normalization have appeared in each implementation. For these parameters and others, there is no consensus as to which choice of technique is most appropriate for what situations, nor how significant an effect on accuracy each parameter has. Consequently, every person implementing a collaborative filtering system must make hard design choices with little guidance. This article provides a set of recommendations to guide design of neighborhood-based prediction systems, based on the results of an empirical study. We apply an analysis framework that divides the neighborhood-based prediction approach into three components and then examines variants of the key parameters in each component. The three components identified are similarity computation, neighbor selection, and rating combination.
引用
收藏
页码:287 / 310
页数:24
相关论文
共 20 条
  • [1] AGGARWAL C, 1999, P ACM SIGKDD INT C K
  • [2] [Anonymous], THESIS U MINNESOTA
  • [3] [Anonymous], 1986, NUMERICAL RECIPES C
  • [4] Basu C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P714
  • [5] BILLSUS D, 1998, P 15 INT C MACH LEAR, P46
  • [6] Breese J. S., 1998, UAI, P43, DOI 10.5555/2074094.2074100
  • [7] CONDLIFF MK, 1999, 1999 SIGIR WORKSH RE
  • [8] DELGADO J, 1999, 1999 SIGIR WORKSH RE
  • [9] Eigentaste: A constant time collaborative filtering algorithm
    Goldberg, K
    Roeder, T
    Gupta, D
    Perkins, C
    [J]. INFORMATION RETRIEVAL, 2001, 4 (02): : 133 - 151
  • [10] Good N, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P439