A methodology for knowledge acquisition from the web

被引:22
作者
Sanchez, David [1 ]
Moreno, Antonio [1 ]
机构
[1] Univ Rovira & Virgili, Dept Comp Sci & Math DEIM, Avda Paisos Catalans 26, E-43007 Tarragona, Spain
关键词
D O I
10.3233/KES-2006-10605
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accessing up-to-date information in a fast and easy way implies the necessity of information management tools to explore and analyse the huge number of available electronic resources. The Web offers a large amount of valuable information for every possible domain, but its human-oriented representation and its size makes difficult and extremely time consuming any kind of centralised computer-based processing. In this paper, a combination of distributed AI and knowledge acquisition techniques is proposed to tackle this problem. In particular, we have designed an incremental and domain independent learning methodology modelled over a multi-agent system that crawls the Web composing knowledge structures (ontologies) from the interrelation of several automatically obtained taxonomies of terms according to the user's interests. Moreover, the obtained ontologies are used to represent, in a structured way, the currently available web resources for the corresponding domain. The paper also presents examples of the potential results over medical and technological domains and compares the results, whenever it is possible, against publicly available taxonomic web search engines obtaining, in all cases, a considerable improvement.
引用
收藏
页码:453 / 475
页数:23
相关论文
共 47 条
[1]  
AGIRRE E, 2000, P WORKSH ONT CONSTR
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]  
ALFONSECA E, 2002, P 1 INT C GEN WORDNE
[4]  
Aussenac-Gilles N., 2000, 12 INT C EKAW 2000 J
[5]   The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [J].
Berners-Lee, T ;
Hendler, J ;
Lassila, O .
SCIENTIFIC AMERICAN, 2001, 284 (05) :34-+
[6]  
Borthwick Andrew, 1999, THESIS
[7]  
Brill E., 2001, P 10 TEXT RETR C TRE
[8]  
Califf ME, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P328
[9]  
CILIBRASI RL, 2004, AUTOMATIC MEANING DI
[10]  
Cimiano P., 2004, ACM SIGKDD EXPLORATI, V6, P24, DOI [DOI 10.1145/1046456.1046460, 10.1145/1046456.1046460]