A comparative evaluation of full-text, concept-based, and context-sensitive search

被引:26
作者
Moskovitch, Robert
Martins, Susana B.
Behiri, Eytan
Weiss, Aviram
Shahar, Yuval
机构
[1] Ben Gurion Univ Negev, Med Informat Res Ctr, Dept Informat Syst Engn, IL-84105 Beer Sheva, Israel
[2] Stanford Univ, Palo Alto, CA 94304 USA
[3] VA Palo Alto Hlth Care Ctr, Palo Alto, CA USA
[4] E&C Med Intelligence Inc, New York, NY USA
关键词
CLINICAL GUIDELINES; RETRIEVAL; INFORMATION; FRAMEWORK; PROJECT;
D O I
10.1197/jamia.M1953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: Study comparatively (1) concept-based search, using documents pre-indexed by a conceptual hierarchy; (2) context-sensitive search, using structured, labeled documents; and (3) traditional full-text search. Hypotheses were: (1) more contexts lead to better retrieval accuracy; and (2) adding concept-based search to the other searches would improve upon their baseline performances. Design: Use our Vaidurya architecture, for search and retrieval evaluation, of structured documents classified by a conceptual hierarchy, on a clinical guidelines test collection. Measurements: Precision computed at different levels of recall to assess the contribution of the retrieval methods. Comparisons of precisions done with recall set at 0.5, using t-tests. Results: Performance increased monotonically with the number of query context elements. Adding context-sensitive elements, mean improvement was 11.1% at recall 0.5. With three contexts, mean query precision was 42% +/- 17% (95% confidence interval [CI], 31% to 53%); with two contexts, 32% +/- 13% (95% CI, 27% to 38%); and one context, 20% +/- 9% (95% CI, 15% to 24%). Adding context-based queries to full-text queries monotonically improved precision beyond the 0.4 level of recall. Mean improvement was 4.5% at recall 0.5. Adding concept-based search to full-text search improved precision to 19.4% at recall 0.5. Conclusions: The study demonstrated usefulness of concept-based and context-sensitive queries for enhancing the precision of retrieval from a digital library of semi-structured clinical guideline documents. Concept-based searches outperformed free-text queries, especially when baseline precision was low. In general, the more ontological elements used in the query, the greater the resulting precision.
引用
收藏
页码:164 / 174
页数:11
相关论文
共 21 条
[1]  
[Anonymous], P 17 ANN ACM SIGIR C
[2]  
CONCARD J, P SIGIR, V2, P371
[3]  
Hersh W, 2000, J AM MED INFORM ASSN, P344
[4]   A COMPARISON OF RETRIEVAL EFFECTIVENESS FOR 3 METHODS OF INDEXING MEDICAL LITERATURE [J].
HERSH, WR ;
HICKAM, DH .
AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 1992, 303 (05) :292-300
[5]   A COMPARISON OF 2 METHODS FOR INDEXING AND RETRIEVAL FROM A FULL-TEXT MEDICAL DATABASE [J].
HERSH, WR ;
HICKAM, DH .
MEDICAL DECISION MAKING, 1993, 13 (03) :220-226
[6]   SAPHIRE - AN INFORMATION-RETRIEVAL SYSTEM FEATURING CONCEPT MATCHING, AUTOMATIC-INDEXING, PROBABILISTIC RETRIEVAL, AND HIERARCHICAL RELATIONSHIPS [J].
HERSH, WR ;
GREENES, RA .
COMPUTERS AND BIOMEDICAL RESEARCH, 1990, 23 (05) :410-425
[7]  
HUMPHREYS BL, 1993, B MED LIBR ASSOC, V81, P170
[8]  
MOSKOVITCH R, 2006, IN PRESS ART INTEL M
[9]  
MOSKOVITCH R, 2005, THESIS B GURION U
[10]  
MOSKOVITCH R, 2004, P MEDINFO 04 SAN FRA