Text mining techniques to automatically enrich a domain ontology

被引:48
作者
Missikoff, M
Velardi, P
Fabriani, P
机构
[1] CNR, Ist Anal Sistemi & Informat, I-00185 Rome, Italy
[2] Univ Roma La Sapienza, DSI, Rome, Italy
关键词
ontology; text-mining; terminology; ontology management system; natural language processing;
D O I
10.1023/A:1023254205945
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Though the utility of domain ontologies is now widely acknowledged in the IT (Information Technology) community, several barriers must be overcome before ontologies become practical and useful tools. A critical issue is the ontology construction, i.e., the task of identifying, defining, and entering the concept definitions. In case of large and complex application domains this task can be lengthy, costly, and controversial (since different persons may have different points of view about the same concept). To reduce time, cost (and, sometimes, harsh discussions) it is highly advisable to refer, in constructing or updating an ontology, to the documents available in the field. Text mining tools may be of great help in this task. The work presented in this paper illustrates the guidelines of SymOntos, ontology management system, and the text mining approach adopted herein to support ontology building. The latter operates by extracting, from the related literature, the prominent domain concepts and the semantic relations among them.
引用
收藏
页码:323 / 340
页数:18
相关论文
共 29 条
[1]
AGIRRE E, 2000, P ECAI 2000 WORKSH O
[2]
[Anonymous], 1992, P 3 C APPL NAT LANG
[3]
An empirical symbolic approach to natural language processing [J].
Basili, R ;
Pazienza, MT ;
Velardi, P .
ARTIFICIAL INTELLIGENCE, 1996, 85 (1-2) :59-99
[4]
BASILI R, 2000, P INT WORKSH PARS TE
[5]
Basili Roberto, 1997, P 2 C EMP METH NAT L
[6]
BRACHMAN R, 1979, ASS NETWORKS REPRESE
[7]
CUCCHIARELLI A, 2000, P 23 ANN SIGIR ATH G
[8]
CUCCHIARELLI A, 1998, NATURAL LANGUAGE DEC
[9]
DAILLE B, 1994, P ACL94 WORKSH BAL A
[10]
FANO R, 1961, TRASMISSION INFORMAT