YAGO: A Large Ontology from Wikipedia and WordNet

被引:940
作者
Suchanek, Fabian M. [1 ]
Kasneci, Gjergji [1 ]
Weikum, Gerhard [1 ]
机构
[1] Max Planck Inst Comp Sci, Saarbrucken, Germany
来源
JOURNAL OF WEB SEMANTICS | 2008年 / 6卷 / 03期
关键词
Ontologies; Information extraction; Knowledge representation;
D O I
10.1016/j.websem.2008.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
This article presents YAGO, a large ontology with high coverage and precision. YAGO has been automatically derived from Wikipedia and WordNet. It comprises entities and relations, and currently contains more than 1.7 million entities and 15 million facts. These include the taxonomic Is-A hierarchy as well as semantic relations between entities. The facts for YAGO have been extracted from the category system and the infoboxes of Wikipedia and have been combined with taxonomic relations from WordNet. Type checking techniques help us keep YAGO's precision at 95%-as proven by an extensive evaluation study. YAGO is based on a clean logical model with a decidable consistency. Furthermore, it allows representing n-ary relations in a natural way while maintaining compatibility with RDFS. A powerful query model facilitates access to YAGO's data. (c) 2008 Published by Elsevier B.V.
引用
收藏
页码:203 / 217
页数:15
相关论文
共 56 条
[1]
AGICHTEIN E, 2000, SNOWBALL EXTRACTING
[2]
[Anonymous], 2007, IJCAI
[3]
[Anonymous], 2006, AAAI SPRING S
[4]
DBpedia: A nucleus for a web of open data [J].
Auer, Soeren ;
Bizer, Christian ;
Kobilarov, Georgi ;
Lehmann, Jens ;
Cyganiak, Richard ;
Ives, Zachary .
SEMANTIC WEB, PROCEEDINGS, 2007, 4825 :722-+
[5]
Baader F., 1998, TERM REWRITING ALL
[6]
Banko M, 2007, K-CAP'07: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, P95
[7]
BAST H, 2007, SIGIR, P671
[8]
BIRON PV, XML SCHEMA 2
[9]
Bizer C., 2008, WWW
[10]
Interval estimation for a binomial proportion - Comment - Rejoinder [J].
Brown, LD ;
Cai, TT ;
DasGupta, A ;
Agresti, A ;
Coull, BA ;
Casella, G ;
Corcoran, C ;
Mehta, C ;
Ghosh, M ;
Santner, TJ ;
Brown, LD ;
Cai, TT ;
DasGupta, A .
STATISTICAL SCIENCE, 2001, 16 (02) :101-133