A methodology to learn ontological attributes from the Web

被引:40
作者
Sanchez, David [1 ]
机构
[1] Univ Rovira & Virgili, Dept Engn Informat & Matemat, Tarragona 43007, Spain
关键词
Ontology learning; Meronyms; Attributes; Features; Web mining; Knowledge acquisition; ACQUISITION;
D O I
10.1016/j.datak.2010.01.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class descriptors such as attributes, features or meronyms are rarely considered when developing ontologies. Even WordNet only includes a reduced amount of part-of relationships. However, these data are crucial for defining concepts such as those considered in classical knowledge representation models. Some attempts have been made to extract those relations from text using general meronymy detection patterns; however, there has been very little work on learning expressive class attributes (including associated domain, range or data values) at an ontological level. In this paper we take this background into consideration when proposing and implementing an automatic, non-supervised and domain-independent methodology to extend ontological classes in terms of learning concept attributes, data-types, value ranges and measurement units. In order to present a general solution and minimize the data sparseness of pattern-based approaches, we use the Web as a massive learning corpus to retrieve data and to infer information distribution using highly contextualized queries aimed at improving the quality of the result. This corpus is also automatically updated in an adaptive manner according to the knowledge already acquired and the learning throughput. Results have been manually checked by means of an expert-based concept-per-concept evaluation for several well distinguished domains showing reliable results and a reasonable learning performance. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:573 / 597
页数:25
相关论文
共 73 条
  • [1] Alfonseca E., 2002, P 1 INT C GEN WORDNE
  • [2] ALMUHAREB A, 2006, P EUR C ART INT, P543
  • [3] Almuhareb A., 2004, Procs. of EMNLP, P158
  • [4] An YJ, 2007, APPLIED COMPUTING 2007, VOL 1 AND 2, P1667
  • [5] [Anonymous], 2005, J NATURAL LANGUAGE P, DOI DOI 10.5715/JNLP.12.3_203
  • [6] [Anonymous], 2001, P 12 EUR C MACH LEAR, DOI DOI 10.1007/3-540-44795-4_42
  • [7] [Anonymous], P 17 ACM C INF KNOWL
  • [8] Banko M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2670
  • [9] Berland M., 1999, P 37 ANN M ASS COMPU, P57, DOI DOI 10.3115/1034678.1034697
  • [10] Berners-Lee T., 2001, The Semantic Web