From XML schema to relations:: A cost-based approach to XML storage

被引:81
作者
Bohannon, P [1 ]
Freire, J [1 ]
Roy, P [1 ]
Siméon, J [1 ]
机构
[1] Bell Labs, Murray Hill, NJ 07974 USA
来源
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2002年
关键词
D O I
10.1109/ICDE.2002.994698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Web applications manipulate an increasing amount of XML, there is a growing interest in storing XML data in relational databases. Due to the mismatch between the complexity of XML's tree structure and the simplicity of flat relational tables, there are many ways to store the same document in an RDBMS, and a number of heuristic techniques have been proposed. These techniques typically define fixed mappings and do not take application characteristics into account. However, a fixed mapping is unlikely to work well for all possible applications. In contrast, LegoDB is a cost-based XML storage mapping engine that explores a space of possible XML-to-relational mappings and selects the best mapping for a given application. LegoDB leverages current XML and relational technologies: 1) it models the target application with an XML Schema, XML data statistics, and an XQuery workload; 2) the space of configurations is generated through XML-Schema rewritings; and 3) the best among the derived configurations is selected using cost estimates obtained through a standard relational optimizer. In this paper, we describe the LegoDB storage engine and provide experimental results that demonstrate the effectiveness of this approach.
引用
收藏
页码:64 / 75
页数:12
相关论文
共 22 条
[1]  
AGRAWAL S, 2000, P 26 INT C VER LARG, P496
[2]  
BOHANNON P, 2001, XML SCHEMA RELATIONS
[3]  
BOSAC J, 1998, GUIDE W3C XML SPECIF
[4]  
CAREY MJ, 2000, P 26 INT C VER LARG, P646
[5]  
CHAMBELIN D, 2001, XQUERY 1 0 XML QUERY
[6]  
CHRISTOPHIDES V, 1994, P ACM SIGMOD INT C M, P313
[7]  
Deutsch A, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P431, DOI 10.1145/304181.304220
[8]  
FANKHAUSER P, 2001, XML QUERY ALGEBRA
[9]   SilkRoute:: trading between relations and XML [J].
Fernández, M ;
Tan, WC ;
Suciu, D .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :723-745
[10]  
FLORESCU D, 1999, 3680 INRIA