Ordered similarity measures taking into account the rank of documents

被引:5
作者
Michel, C [1 ]
机构
[1] Univ Bordeaux 3, MSHA, GRESIC, Lab CEM, F-33607 Pessac, France
关键词
metrics; similarity measure; rank; evaluation; information retrieval;
D O I
10.1016/S0306-4573(00)00040-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Indices of similarity are used to quantify the difference between two sets of documents. Usually, they are based on the number of elements that they have in common. Indeed, they are calculated from the results of the intersections or unions of the compared sets. But many studies show that order of presentation of the documents is an important fact to be taken into account, particularly in the case of system's evaluation, which is not the case as far as usual measures are concerned. in this article, we propose a general method for the construction of measures of similarity taking into account the rank of presentation of the document. We will call them Ordered Similarity measures, i.e,, measures of OS. Then, we present an experimentation of evaluation used to quantify the filtering impact of a system. This protocol is based on a large scale interrogation of the system and on a comparison of answer sets. We present simultaneously the results of comparisons obtained by a classical measure and by an OS measure. Finally we show how to construct OS measures derived from recall and precision. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:603 / 622
页数:20
相关论文
共 29 条
  • [1] BORLUND P, 1998, P 21 ACM SIGIR C RES
  • [2] BOYCE BR, 1994, MEASUREMENT INFORMAT
  • [3] FLUHR C, 1997, P INET 97 7 ANN C IN
  • [4] Jean Tague-Sutcliffe on measuring information
    Fricke, M
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1998, 34 (04) : 385 - 394
  • [5] Harter SP, 1996, J AM SOC INFORM SCI, V47, P37, DOI 10.1002/(SICI)1097-4571(199601)47:1<37::AID-ASI4>3.0.CO
  • [6] 2-3
  • [7] HARTER SP, 1997, ANN REV INFORMATION, V32, P1
  • [8] Improving information retrieval by combining user profile and document segmentation
    LaineCruzel, S
    Lafouge, T
    Lardy, JP
    BenAbdallah, N
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (03) : 305 - 315
  • [9] LOSEE RM, 1990, SCI INFORMATION MEAS
  • [10] MICHEL C, 1999, P RIAO 2000 CONT BAS