Evaluating the effectiveness of content-oriented XML retrieval methods

被引:6
作者
Goevert, Norbert [1 ]
Fuhr, Norbert
Lalmas, Mounia
Kazai, Gabriella
机构
[1] Univ Dortmund, D-44221 Dortmund, Germany
[2] Univ Duisburg Gesamthsch, D-4100 Duisburg, Germany
[3] Univ London Queen Mary & Westfield Coll, London E1 4NS, England
来源
INFORMATION RETRIEVAL | 2006年 / 9卷 / 06期
关键词
XML retrieval; evaluation; effectiveness; metrics; exhaustiveness and specificity;
D O I
10.1007/s10791-006-9008-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Content-oriented XML retrieval approaches aim at a more focused retrieval strategy: Instead of retrieving whole documents, document components that are exhaustive to the information need while at the same time being as specific as possible should be retrieved. In this article, we show that the evaluation methods developed for standard retrieval must be modified in order to deal with the structure of XML documents. More precisely, the size and overlap of document components must be taken into account. For this purpose, we propose a new effectiveness metric based on the definition of a concept space defined upon the notions of exhaustiveness and specificity of a search result. We compare the results of this new metric by the results obtained with the official metric used in INEX, the evaluation initiative for content-oriented XML retrieval.
引用
收藏
页码:699 / 722
页数:24
相关论文
共 30 条
[1]  
[Anonymous], P 13 TEXT RETR C TRE
[2]  
[Anonymous], P 2 C CONC LIB INF S
[3]  
BAEZAYATES R, 2002, P SIGIR 2002 WORKSH
[4]  
BAEZAYATES R, 2000, P SIGIR 2000 WORKSH
[5]  
Beaulieu M, 1996, J AM SOC INFORM SCI, V47, P85, DOI 10.1002/(SICI)1097-4571(199601)47:1<85::AID-ASI8>3.0.CO
[6]  
2-Z
[7]  
CHIARAMELLA Y, 1996, 8134 FERMI ESPRIT BR
[8]  
CLARK J, 1999, XML PATH LANGUGE XPA
[9]  
CLEVERDON CW, 1966, FACTORS DETERMINING, V2
[10]  
Cooper W.S., 1968, J AM SOC INFORM SCI, V19, P30