Representing and reasoning on XML documents: A description logic approach

被引:54
作者
Calvanese, D [1 ]
De Giacomo, G [1 ]
Lenzerini, M [1 ]
机构
[1] Univ Roma La Sapienza, Dipartimento Informat & Sistemist, I-00198 Rome, Italy
关键词
knowledge representation; automated reasoning; description logics; XML; SGML;
D O I
10.1093/logcom/9.3.295
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent proposals to improve the quality of interaction with the World Wide Web suggest considering the Web as a huge semistructured database, so that retrieving information can be supported by the task of database querying. Under this view, it is important to represent the form of both the network, and:the: documents placed in the nodes of the network. However, the current proposals do not pay sufficient attention to represent document structures and reasoning about them. In this paper, we address these problems by providing a framework where Document Type Definitions (DTDs) expressed in the extensible Markup Language (XML) are formalized in an expressive Description Logic equipped with sound and complete inference algorithms. We provide methods for verifying conformance of a-document to a DTD in polynomial time, and structural equivalence of DTDs in worst case deterministic exponential time. improving known algorithms for this problem which were double exponential. We also deal with parametric versions of conformance and structural equivalence, and investigate other forms of reasoning on DTDs. Finally, we show how to take advantage of the reasoning capabilities of our formalism in order to perform several optimization steps in answering queries posed to a document base.
引用
收藏
页码:295 / 318
页数:24
相关论文
共 43 条
[1]   Querying documents in object databases [J].
Abiteboul S. ;
Cluet S. ;
Christophides V. ;
Milo T. ;
Moerkotte G. ;
Siméon J. .
International Journal on Digital Libraries, 1997, 1 (1) :5-19
[2]  
Abiteboul S., 1995, Foundations of databases, V1st
[3]  
[Anonymous], LECT NOTES ARTIFICIA
[4]  
[Anonymous], 1996, PRINC KNOWL REPRESEN
[5]   Part-whole relations in object-centered systems: An overview [J].
Artale, A ;
Franconi, E ;
Guarino, N ;
Pazzi, L .
DATA & KNOWLEDGE ENGINEERING, 1996, 20 (03) :347-383
[6]  
ARTALE A, 1996, P 1996 DESCR LOG WOR, P70
[7]   Using automata theory for characterizing the semantics of terminological cycles [J].
Baader, F .
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 1996, 18 (2-4) :175-219
[8]  
BAADER F, 1991, P 12 INT JIONT C ART
[9]  
Bray Tim, 1998, Extensible markup language
[10]  
Calvanese D, 1996, MOR KAUF R, P292