Learning to match ontologies on the Semantic Web

被引:9
作者
Doan, A
Madhavan, J
Dhamankar, R
Domingos, P
Halevy, A
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
Semantic Web; ontology matching; machine learning; relaxation labeling;
D O I
10.1007/s00778-003-0104-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
On the Semantic Web, data will inevitably come from many different ontologies, and information processing across ontologies is not possible without knowing the semantic mappings between them. Manually finding such mappings is tedious, error-prone, and clearly not possible on the Web scale. Hence the development of tools to assist in the ontology mapping process is crucial to the success of the Semantic Web. We describe GLUE, a system that employs machine learning techniques to find such mappings. Given two ontologies, for each concept in one ontology GLUE finds the most similar concept in the other ontology. We give well-founded probabilistic definitions to several practical similarity measures and show that GLUE can work with all of them. Another key feature of GLUE is that it uses multiple learning strategies, each of which exploits well a different type of information either in the data instances or in the taxonomic structure of the ontologies. To further improve matching accuracy, we extend GLUE to incorporate commonsense knowledge and domain constraints into the matching process. Our approach is thus distinguished in that it works with a variety of well-defined similarity notions and that it efficiently incorporates multiple types of knowledge. We describe a set of experiments on several real-world domains and show that GLUE proposes highly accurate semantic mappings. Finally, we extend GLUE to find complex mappings between ontologies and describe experiments that show the promise of the approach.
引用
收藏
页码:303 / 319
页数:17
相关论文
共 45 条
[1]  
Agresti A., 1990, CATEGORICAL DATA ANA
[2]  
[Anonymous], 2003, Handbook on Ontologies in Information Systems
[3]  
[Anonymous], 2001, P INT WORKSH WEB DYN
[4]  
[Anonymous], 2002, WIDE WORLD SMALL HOM
[5]  
BERLIN J, 2002, P 14 INT C ADV INF S, P452
[6]   The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [J].
Berners-Lee, T ;
Hendler, J ;
Lassila, O .
SCIENTIFIC AMERICAN, 2001, 284 (05) :34-+
[7]  
Brickley D., 2000, RESOURCE DESCRIPTION
[8]  
Broekstra J., 2001, P 10 INT WORLD WIDE, P467
[9]  
CALVANESE D, 2001, 2001 INT DESCR LOG W
[10]  
Chakrabarti S., 1998, SIGMOD Record, V27, P307, DOI 10.1145/276305.276332