Information retrieval by semantic similarity

被引:124
作者
Hliaoutakis, Angelos [1 ]
Varelas, Giannis
Voutsakis, Epimenidis
Petrakis, Euripides G. M.
Milios, Evangelos
机构
[1] Tech Univ Crete, Piraeus, Greece
[2] Dalhousie Univ, Halifax, NS B3H 3J5, Canada
关键词
document retrieval systems; information retrieval; medical information systems; ontologies; semantic similarity;
D O I
10.4018/jswis.2006070104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic Similarity relates to computing the similarity between conceptually similar but not necessarily lexically similar terms. Typically, semantic similarity is computed by mapping terms to an ontology and by examining their relationships in that ontology. We investigate approaches to computing the semantic similarity between natural language terms (using WordNet as the underlying reference ontology) and between medical terms (using the MeSH ontology of medical and biomedical terms). The most popular semantic similarity methods are implemented and evaluated using WordNet and MeSH. Building upon semantic similarity, we propose the Semantic Similarity based Retrieval Model (SSRM), a novel information retrieval method capable for discovering similarities between documents containing conceptually similar terms. The most effective semantic similarity method is implemented into SSRM. SSRM has been applied in retrieval on OHSUMED (a standard TREC collection available on the Web). The experimental results demonstrated promising performance improvements over classic information retrieval methods utilizing plain lexical matching (e.g., Vector Space Model) and also over state-of-the-art semantic similariiy retrieval methods utilizing ontologies.
引用
收藏
页码:55 / 73
页数:19
相关论文
共 40 条
[1]  
[Anonymous], 1999, Modern Information Retrieval
[2]  
ARASU A, 2002, ACM T INTERNET TECHN, V1, P2
[3]  
Aslandogan Y. A., 2000, Proceedings ACM Multimedia 2000, P313, DOI 10.1145/354384.354514
[4]   LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS [J].
ATTAR, R ;
FRAENKEL, AS .
JOURNAL OF THE ACM, 1977, 24 (03) :397-417
[5]  
Collins-Thompson Kevyn, 2005, PROC, P704
[6]  
Heng Tao Shen, 2000, Proceedings ACM Multimedia 2000, P39, DOI 10.1145/354384.376098
[7]  
Hersh W., 1994, P 17 ANN INT ACMSIGI, P192, DOI DOI 10.1007/978-1-4471-2099-5_20
[8]  
Hliaoutakis A, 2006, LECT NOTES COMPUT SC, V4172, P512
[9]  
Jiang J., 1998, P INT C RES COMPUTAT
[10]   Image retrieval from the World Wide Web: Issues, techniques, and systems [J].
Kherfi, ML ;
Ziou, D ;
Bernardi, A .
ACM COMPUTING SURVEYS, 2004, 36 (01) :35-67