STATISTICAL ANALYSIS OF k-NEAREST NEIGHBOR COLLABORATIVE RECOMMENDATION

被引:19
作者
Biau, Gerard [1 ,2 ]
Cadre, Benoit [3 ]
Rouviere, Laurent [4 ]
机构
[1] Univ Paris 06, LSTA, F-75013 Paris, France
[2] Univ Paris 06, LPMA, F-75013 Paris, France
[3] UEB, IRMAR, ENS CACHAN BRETAGNE, CNRS, F-35170 Bruz, France
[4] UEB, IRMAR, CREST ENSAI, F-35172 Bruz, France
关键词
Collaborative recommendation; cosine-type similarity; nearest neighbor estimate; consistency; rate of convergence; SYSTEMS;
D O I
10.1214/09-AOS759
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Collaborative recommendation is an information-filtering technique that attempts to present information items that are likely of interest to an Internet user. Traditionally, collaborative systems deal with situations with two types of variables, users and items. In its most common form, the problem is framed as trying to estimate ratings for items that have not yet been consumed by a user. Despite wide-ranging literature, little is known about the statistical properties of recommendation systems. In fact, no clear probabilistic model even exists which would allow us to precisely describe the mathematical forces driving collaborative filtering. To provide an initial contribution to this, we propose to set out a general sequential stochastic model for collaborative recommendation. We offer an in-depth analysis of the so-called cosine-type nearest neighbor collaborative method, which is one of the most widely used algorithms in collaborative filtering, and analyze its asymptotic performance as the number of users grows. We establish consistency of the procedure under mild assumptions on the model. Rates of convergence and examples are also provided.
引用
收藏
页码:1568 / 1592
页数:25
相关论文
共 16 条
[1]  
Abernethy J, 2009, J MACH LEARN RES, V10, P803
[2]   Incorporating contextual information in recommender systems using a multidimensional approach [J].
Adomavicius, G ;
Sankaranarayanan, R ;
Sen, S ;
Tuzhilin, A .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2005, 23 (01) :103-145
[3]   Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions [J].
Adomavicius, G ;
Tuzhilin, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) :734-749
[4]  
Breese J. S., 2013, P 14 C UNC ART INT
[5]  
CANDES E, 2009, MATRIX COMPLET UNPUB
[6]   Exact Matrix Completion via Convex Optimization [J].
Candes, Emmanuel J. ;
Recht, Benjamin .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (06) :717-772
[7]   Personalized recommendation system based on product specification values [J].
Choi, Sang Hyun ;
Kang, Sungmin ;
Jeon, Young Jun .
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (03) :607-616
[8]  
Gyorfi L., 2002, DISTRIBUTION FREE TH
[9]  
Gyorfi L., 2013, A Probabilistic Theory of Pattern Recognition, V31
[10]   Dependency networks for inference, collaborative filtering, and data visualization [J].
Heckerman, D ;
Chickering, DM ;
Meek, C ;
Rounthwaite, R ;
Kadie, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (01) :49-75