Measuring the coverage and redundancy of information search services on e-commerce platforms

被引:16
作者
Ma, Baojun [1 ]
Wei, Qiang [1 ]
机构
[1] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Information coverage; Information redundancy; Information search; Information structure; RECOMMENDER SYSTEMS; ONLINE; WEB; ALGORITHMS; TOPICALITY; CONSUMERS; REVIEWS; NOVELTY; MATTER; MARKET;
D O I
10.1016/j.elerap.2012.09.001
中图分类号
F [经济];
学科分类号
02 ;
摘要
Today's widespread e-commerce applications pose a new challenge to information search services. They must extract a useful small set of search or recommendation results from a larger set that preserves information diversity. This paper proposes a novel metric setting to measure two important aspects of information diversity, information coverage and information redundancy. In addition to content coverage, we consider another important measure of information coverage called structure coverage, and model it using information entropy. This approach can better convey the information coverage of the extracted small set with respect to the original large set. The proposed metrics are effective and have various useful properties, which are demonstrated by theoretical and experimental analysis. We also designed a calculation method that shows good computational efficiency. Finally, we conducted an experiment using real data from online customer reviews to further emphasize the effectiveness of the proposed metrics. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:560 / 569
页数:10
相关论文
共 97 条
  • [1] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    Adomavicius, G
    Tuzhilin, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
  • [2] Agichtein E., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P3, DOI 10.1145/1148170.1148175
  • [3] Agichtein E., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P19, DOI 10.1145/1148170.1148177
  • [4] Agrawal R, 2009, P 2 ACM INT C WEB SE, V09, P5, DOI [DOI 10.1145/1498759.1498766, 10.1145/1498759.1498766]
  • [5] Performance evaluation of density-based clustering methods
    Aliguliyev, Ramiz M.
    [J]. INFORMATION SCIENCES, 2009, 179 (20) : 3583 - 3602
  • [6] Clustering of document collection - A weighting approach
    Aliguliyev, Ramiz M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7904 - 7916
  • [7] Allan J., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P307
  • [8] [Anonymous], 1971, The SMART Retrieval System-Experiments in Automatic Document Processing
  • [9] [Anonymous], 2006, P 29 ANN INT ACM SIG, DOI [DOI 10.1145/1148170.1148245, 10.1145/1148170.1148245]
  • [10] [Anonymous], 2008, International Conference on Machine Learning (ICML)