Internationalization of Linked Data: The case of the Greek DBpedia edition

被引:20
作者
Kontokostas, Dimitris [1 ]
Bratsas, Charalampos [1 ]
Auer, Soeren [2 ]
Hellmann, Sebastian [2 ]
Antoniou, Ioannis [1 ]
Metakides, George [1 ]
机构
[1] Aristotle Univ Thessaloniki, Web Sci Program, Dept Math, Thessaloniki, Greece
[2] Univ Leipzig, Inst Informat, D-04109 Leipzig, Germany
来源
JOURNAL OF WEB SEMANTICS | 2012年 / 15卷
关键词
DBpedia; Multilingual; Internationalization; Linked Data; IRI; URI;
D O I
10.1016/j.websem.2012.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
This paper describes the deployment of the Greek DBpedia and the contribution to the DBpedia information extraction framework with regard to internationalization (I18n) and multilingual support. I18n filters are proposed as pluggable components in order to address issues when extracting knowledge from non-English Wikipedia editions. We report on our strategy for supporting the International Resource Identifier (IRI) and introduce two new extractors to complement the I18n filters. Additionally, the paper discusses the definition of Transparent Content Negotiation (TCN) rules for IRIs to address de-referencing and IRI serialization problems. The aim of this research is to establish best practices (complemented by software) to allow the DBpedia community to easily generate, maintain and properly interlink language-specific DBpedia editions. Furthermore, these best practices can be applied for the publication of Linked Data in non-Latin languages in general. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:51 / 61
页数:11
相关论文
共 22 条
[1]
Adida Ben., 2008, RDFa in XHTML: Syntax and processing. Recommendation
[2]
[Anonymous], SCALE FREE TOPOLOGY
[3]
[Anonymous], 2008, N-quads: Extending n-triples with context
[4]
[Anonymous], P 5 OP KNOWL C
[5]
Auer S, 2010, LECT NOTES COMPUT SC, V6497, P1, DOI 10.1007/978-3-642-17749-1_1
[6]
BECKETT D, 2007, TURTLE TERSE RDF TRI
[7]
The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [J].
Berners-Lee, T ;
Hendler, J ;
Lassila, O .
SCIENTIFIC AMERICAN, 2001, 284 (05) :34-+
[8]
Berners-Lee T., 2008, Notation3 (N3): A readable RDF syntax
[9]
Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[10]
DBpedia - A crystallization point for the Web of Data [J].
Bizer, Christian ;
Lehmann, Jens ;
Kobilarov, Georgi ;
Auer, Soeren ;
Becker, Christian ;
Cyganiak, Richard ;
Hellmann, Sebastian .
JOURNAL OF WEB SEMANTICS, 2009, 7 (03) :154-165