Measuring Semantic Similarity Between Biomedical Concepts Within Multiple Ontologies

被引:62
作者
Al-Mubaid, Hisham [1 ]
Nguyen, Hoa A. [1 ]
机构
[1] Univ Houston Clear Lake City, Houston, TX 77058 USA
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2009年 / 39卷 / 04期
关键词
Biomedical information retrieval; biomedical ontology; biomedical terminology; semantic similarity; Unified Medical Language System (UMLS); WORDNET;
D O I
10.1109/TSMCC.2009.2020689
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the intelligent knowledge-based applications contain components for measuring semantic similarity between terms. Many of the existing semantic similarity measures that use ontology structure as their primary source cannot measure semantic similarity between terms and concepts using multiple ontologies. This research explores a new way to measure semantic similarity between biomedical concepts using multiple ontologies. We propose a new ontology-structure-based technique for measuring semantic similarity in single ontology and across multiple ontologies in the biomedical domain within the framework of Unified Medical Language System (UMLS). The proposed measure is based on three features: 1) cross-modified path length between two concepts; 2) a new feature of common specificity of concepts in the ontology; and 3) local granularity of ontology clusters. The proposed technique was evaluated relative to human similarity scores and compared with other existing measures using two terminologies within UMLS framework: Medical Subject Headings and Systemized Nomenclature of Medicine Clinical Term. The experimental results validate the efficiency of the proposed technique in single and multiple ontologies, and demonstrate that our proposed measure achieves the best results of correlation with human scores in all experiments.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 25 条
[1]  
Al-Mubaid H., 2007, P 22 ACM S APPL COMP
[2]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[3]   Towards the development of a conceptual distance metric for the UMLS [J].
Caviedes, JE ;
Cimino, JJ .
JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (02) :77-85
[4]  
HLIAOUTAKIS A, 2005, THESIS TU CRETE CHAN
[5]   An approach for organizing knowledge according to terminology and representing it visually [J].
Ishida, K ;
Ohta, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04) :366-373
[6]  
JIANG JJ, 1997, P ROCLING 10 TAIW
[7]  
KLEINSORGE R, 2000, UNIFIED MED LANGUAGE
[8]  
Kuntz H., 2005, SNOMED CT STANDARD T
[9]  
Leacock C, 1998, LANG SPEECH & COMMUN, P265
[10]  
Li YH, 2003, IEEE T KNOWL DATA EN, V15, P871, DOI 10.1109/TKDE.2003.1209005