Collaborative filtering based on iterative principal component analysis

被引:90
作者
Kim, D [1 ]
Yum, BJ [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Ind Engn, Taejon 305701, South Korea
关键词
recommender system; collaborative filtering; principal component analysis; singular value decomposition;
D O I
10.1016/j.eswa.2004.12.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collaborative filtering (CF) is one of the most popular recommender system technologies, and utilizes the known preferences of a group of users to predict the unknown preference of a new user. However, the existing CF techniques has the drawback that it requires the entire existing data be maintained and analyzed repeatedly whenever new user ratings are added. To avoid such a problem, Eigentaste, a CF approach based on the principal component analysis (PCA), has been proposed. However, Eigentaste requires that each user rate every item in the so called gauge set for executing PCA, which may not be always feasible in practice. Developed in this article is an iterative PCA approach in which no gauge set is required, and singular value decomposition is employed for estimating missing ratings and dimensionality reduction. Principal component values for users in reduced dimension are used for clustering users. Then, the proposed approach is compared to Eigentaste in terms of the mean absolute error of prediction using the Jester, MovieLens, and EachMovie data sets. Experimental results show that the proposed approach, even without a gauge set, performs slightly better than Eigentaste regardless of the data set and clustering method employed, implying that it can be used as a useful alternative when defining a gauge set is neither possible nor practical. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:823 / 830
页数:8
相关论文
共 10 条
  • [1] Principal component analysis of dissolution data with missing elements
    Adams, E
    Walczak, B
    Vervaet, C
    Risha, PG
    Massart, DL
    [J]. INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2002, 234 (1-2) : 169 - 178
  • [2] Eigentaste: A constant time collaborative filtering algorithm
    Goldberg, K
    Roeder, T
    Gupta, D
    Perkins, C
    [J]. INFORMATION RETRIEVAL, 2001, 4 (02): : 133 - 151
  • [3] Golub G. H., 1996, MATRIX COMPUTATIONS
  • [4] Hart, 2006, PATTERN CLASSIFICATI
  • [5] Johnson R.A., 1988, Applied multivariate statistical analysis
  • [6] MacQueen J., 1967, P 5 BERKELEY S MATH, V1, P281
  • [7] Montgomery D.C., 1999, Design and analysis of experiment
  • [8] Resnick Paul, 1994, P ACM C COMP SUPP CO, P175, DOI DOI 10.1145/192844.192905
  • [9] Sarwar B., 2000, ACM WEBKDD 2000 WEB, V1625, DOI [10.3141/1625-22, DOI 10.3141/1625-22]
  • [10] Walczak B, 2001, CHEMOMETR INTELL LAB, V58, P15, DOI 10.1016/S0169-7439(01)00131-9