A semantic similarity method based on information content exploiting multiple ontologies

被引:68
作者
Sanchez, David [1 ]
Batet, Montserrat [1 ]
机构
[1] Univ Rovira & Virgili, Dept Engn Informat & Matemat, Tarragona 43007, Spain
关键词
Information content; Semantic similarity; Ontologies; MeSH; SNOMED CT; TAXONOMY; DOMAIN; TEXT;
D O I
10.1016/j.eswa.2012.08.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quantification of the semantic similarity between terms is an important research area that configures a valuable tool for text understanding. Among the different paradigms used by related works to compute semantic similarity, in recent years, information theoretic approaches have shown promising results by computing the information content (IC) of concepts from the knowledge provided by ontologies. These approaches, however, are hampered by the coverage offered by the single input ontology. In this paper, we propose extending IC-based similarity measures by considering multiple ontologies in an integrated way. Several strategies are proposed according to which ontology the evaluated terms belong. Our proposal has been evaluated by means of a widely used benchmark of medical terms and MeSH and SNOMED CT as ontologies. Results show an improvement in the similarity assessment accuracy when multiple ontologies are considered. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1393 / 1399
页数:7
相关论文
共 39 条
[1]  
Al-Mubaid Hisham, 2006, Conf Proc IEEE Eng Med Biol Soc, V2006, P2713
[2]   Measuring Semantic Similarity Between Biomedical Concepts Within Multiple Ontologies [J].
Al-Mubaid, Hisham ;
Nguyen, Hoa A. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2009, 39 (04) :389-398
[3]  
[Anonymous], 1997, P 10 RES COMPUTATION
[4]   Discovering implicit intention-level knowledge from natural-language texts [J].
Atkinson, John ;
Ferreira, Anita ;
Aravena, Elvis .
KNOWLEDGE-BASED SYSTEMS, 2009, 22 (07) :502-508
[5]   Ontology-based semantic clustering [J].
Batet, Montserrat .
AI COMMUNICATIONS, 2011, 24 (03) :291-292
[6]   An ontology-based measure to compute semantic similarity in biomedicine [J].
Batet, Montserrat ;
Sanchez, David ;
Valls, Aida .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (01) :118-125
[7]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[8]   The Google similarity distance [J].
Cilibrasi, Rudi L. ;
Vitanyi, Paul M. B. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (03) :370-383
[9]  
Ding L., 2004, P 13 ACM INT C INF K, P652, DOI DOI 10.1145/1031171.1031289
[10]  
Leacock C, 1998, LANG SPEECH & COMMUN, P265