THE EFFECTIVENESS OF DOCUMENT NEIGHBORING IN SEARCH ENHANCEMENT

被引:34
作者
WILBUR, WJ
COFFEE, L
机构
[1] National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
关键词
D O I
10.1016/0306-4573(94)90068-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider two kinds of queries that may be applied to a database. The first is a query written by a searcher to express an information need. The second is a request for documents most similar to a document already judged relevant by the searcher. We examine the effectiveness of these two procedures and show that in important cases the latter query type is more effective than the former. This provides a new view of the cluster hypothesis and a justification for document neighboring procedures (precomputation of closely related documents). If all the documents in a database have readily available precomputed nearest neighbors, a new search algorithm, which we call parallel neighborhood searching, is conveniently used. We show that this feedback-based method provides significant improvement in recall over traditional linear searching methods, and even appears superior to traditional feedback methods in overall performance.
引用
收藏
页码:253 / 266
页数:14
相关论文
共 23 条
[1]  
BORODIN A, 1971, SMART RETRIEVAL SYST, P394
[2]  
Buckley Chris, 1985, 85686 CORN U DEP COM
[3]  
Chang Y, 1971, SMART RETRIEVAL SYST, P355
[4]  
CROFT WB, 1982, 8221 U MASS TECHN RE
[5]  
FOX EA, 1990, VIRGINIA DISC ONE
[6]  
IDE E., 1971, SMART RETRIEVAL SYST, P337
[7]   USE OF HIERARCHIC CLUSTERING IN INFORMATION RETRIEVAL [J].
JARDINE, N ;
VANRIJSB.CJ .
INFORMATION STORAGE AND RETRIEVAL, 1971, 7 (05) :217-&
[8]  
JONES KS, 1972, J DOC, V28, P11, DOI DOI 10.1108/EB026526
[9]  
LESK ME, 1971, SMART RETRIEVAL SYST, P506
[10]  
LUCARELLA D, 1988, J INFORM SCI, V14, P25, DOI 10.1177/016555158801400104