Evaluating combinations of ranked lists and visualizations of inter-document similarity

被引:18
作者
Allan, J [1 ]
Leuski, A [1 ]
Swan, R [1 ]
Byrd, D [1 ]
机构
[1] Univ Massachusetts, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
information retrieval; clustering; document visualization; user study;
D O I
10.1016/S0306-4573(00)00056-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We are interested in how ideas from document clustering can be used to improve the retrieval accuracy of ranked lists in interactive systems. In particular, we are interested in ways to evaluate the effectiveness of such systems to decide how they might best be constructed. In this study, we construct and evaluate systems that present the user with ranked lists and a visualization of inter-document similarities. We first carry out a user study to evaluate the clustering/ranked list combination on instance-oriented retrieval, the task of the TREC-6 Interactive Track. We find that although users generally prefer the combination, they are not able to use it to improve effectiveness. In the second half of this study. we develop and evaluate an approach that more directly combines the ranked list with information from inter-document similarities. Using the TREC collections and relevance judgments, we show that it is possible to realize substantial improvements in effectiveness by doing so, and that although users can use the combined information effectively, the system can provide hints that substantially improve on the user-a solo effort. The resulting approach shares much in common with an interactive application of incremental relevance feedback. Throughout this study, we illustrate our work using two prototype systems constructed for these evaluations. The first, AspInQuery, is a classic information retrieval system augmented with a specialized tool for recording information about instances of relevance. The other system, Lighthouse, is a Web-based application that combines a ranked list with a portrayal of inter-document similarity. Lighthouse can work with collections such as TREC, as well as the results of Web search engines. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:435 / 458
页数:24
相关论文
共 34 条
[1]  
AALBERSBERG IJ, 1992, P 15 ANN INT ACM SIG, P11
[2]   Building hypertext using information retrieval [J].
Allan, J .
INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (02) :145-159
[3]  
Allan J., 1998, Sixth Text REtrieval Conference (TREC-6) (NIST SP 500-240), P169
[4]  
Allan J., 1997, Fifth Text REtrieval Conference (TREC-5) (NIST SP 500-238), P119
[5]  
ALLAN J, 1996, P 19 ANN INT ACM SIG, P270
[6]  
ALLAN J, 1995, THESIS CORNELL U
[7]  
Anick PG, 1997, PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P314, DOI 10.1145/278459.258601
[8]  
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[9]   INFORMATION-RETRIEVAL - A SEQUENTIAL LEARNING-PROCESS [J].
BOOKSTEIN, A .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1983, 34 (05) :331-342
[10]  
Borg I., 1987, Multidimensional Similarity Structure Analysis