Efficient bulk operations on dynamic R-trees

被引:55
作者
Arge, L [1 ]
Hinrichs, KH
Vahrenhold, J
Vitter, JS
机构
[1] Duke Univ, Dept Comp Sci, Ctr Geometr Comp, Durham, NC 27708 USA
[2] Univ Munster, Inst Informat, D-48149 Munster, Germany
关键词
external memory; I/O-efficient; R-tree; bulk loading; bulk operations;
D O I
10.1007/s00453-001-0107-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years there has been an upsurge of interest in spatial databases, A major issue is how to manipulate efficiently massive amounts of spatial data stored on disk in multidimensional spatial indexes (data structures). Construction of spatial indexes (bulk loading) has been studied intensively in the database community. The continuous arrival of massive amounts of new data makes it important to update existing indexes (bulk updating) efficiently. In this paper we present a simple, yet efficient, technique for performing bulk update and query operations on multidimensional indexes. We present our technique in terms of the so-called R-tree and its variants, as they have emerged as practically efficient indexing methods for spatial data. Our method uses ideas from the buffer tree lazy buffering technique and fully utilizes the available internal memory and the page size of the operating system. We give a theoretical analysis of our technique, showing that it is efficient both in terms of I/O communication, disk storage, and internal computation time. We also present the results of an extensive set of experiments showing that in practice our approach performs better than the previously best known bulk update methods with respect to update time, and that it produces a better quality index in terms of query performance. One important novel feature of our technique is that in most cases it allows us to perform a batch of updates and queries simultaneously. To be able to do so is essential in environments where queries have to be answered even while the index is being updated and reorganized.
引用
收藏
页码:104 / 128
页数:25
相关论文
共 34 条
[1]   THE INPUT OUTPUT COMPLEXITY OF SORTING AND RELATED PROBLEMS [J].
AGGARWAL, A ;
VITTER, JS .
COMMUNICATIONS OF THE ACM, 1988, 31 (09) :1116-1127
[2]  
Arge L, 1995, LECT NOTES COMPUT SC, V955, P334
[3]  
ARGE L, 1997, LNCS, V1340, P213
[4]  
Arge L., 1999, TPIE User Manual and Reference (edition 0.9.01a)
[5]  
ARGE L, 1996, THESIS AARHUS U DENM
[6]  
ARGE L, 2002, HDB MASSIVE DATA SET
[7]  
Bayer R., 1972, Acta Informatica, V1, P173, DOI 10.1007/BF00288683
[8]  
Becker B., 1996, VLDB Journal, V5, P264, DOI 10.1007/s007780050028
[9]  
BECKMANN N, 1990, SIGMOD REC, V19, P322
[10]  
Berchtold S, 1998, LECT NOTES COMPUT SC, V1377, P216