The FedLemur project: Federated search in the real world

被引:33
作者
Avrahami, TT [1 ]
Yau, L [1 ]
Si, L [1 ]
Callan, J [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2006年 / 57卷 / 03期
关键词
D O I
10.1002/asi.20283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated search and distributed information retrieval systems provide a single user interface for searching multiple full-text search engines. They have been an active area of research for more than a decade, but in spite of their success as a research topic, they are still rare in operational environments. This article discusses a prototype federated search system developed for the U.S. government's FedStats Web portal, and the issues addressed in adapting research solutions to this operational environment. A series of experiments explore how well prior research results, parameter settings, and heuristics apply in the FedStats environment. The article concludes with a set of lessons learned from this technology transfer effort, including observations about search engine quality in the "real world."
引用
收藏
页码:347 / 358
页数:12
相关论文
共 26 条
[1]  
[Anonymous], P 18 INT ACM SIGIR C
[2]   Query-based sampling of text databases [J].
Callan, J ;
Connell, M .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (02) :97-130
[3]  
Callan J, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P479, DOI 10.1145/304181.304224
[4]  
CALLAN J, 2000, ADV INFORM RETRIEVAL, P127
[5]  
Conrad J. G., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P71
[6]  
CRASWELL N, 2000, P 5 ACM C DIG LIB SA, P37
[7]  
FRENCH JC, 1998, P 3D ACM INT C DIG L, P283
[8]  
FRENCH JC, 1999, P 22 ANN INT ACM SIG
[9]   A decision-theoretic approach to database selection in networked IR [J].
Fuhr, N .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1999, 17 (03) :229-249
[10]   GlOSS:: Text-source discovery over the Internet [J].
Gravano, L ;
García-Molina, H ;
Tomasic, A .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1999, 24 (02) :229-264