Efficient maintenance of materialized top-k views

被引:46
作者
Yi, K [1 ]
Yu, H [1 ]
Yang, J [1 ]
Xia, GQ [1 ]
Chen, YG [1 ]
机构
[1] Duke Univ, Dept Comp Sci, Durham, NC 27706 USA
来源
19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2003年
关键词
D O I
10.1109/ICDE.2003.1260792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of maintaining materialized top-k views in this paper Top-k queries, including MIN and MAX as important special cases, occur frequently in common database workloads. A top-k view can be materialized to improve query performance, but in general it is not self-maintainable unless it contains all tuples in the base table. Deletions and updates on the base table may cause tuples to leave the top-k view, resulting in expensive queries over the base table to "refill" the view. In this paper we propose an algorithm that reduces the frequency of refills by maintaining a top-k' view instead of a top-k view, where k' changes at runtime between k and some k(max) greater than or equal to k. we show that in most practical cases, our algorithm can reduce the expected amortized cost of refill queries to 0(1) while still keeping the view small The optimal value of k(max) a depends on the update pattern and the costs of querying the base table and updating the view. Compared with the simple approach of maintaining either the top-k view itself or a copy of the base table, our algorithm can provide orders-of magnitude improvements in performance with appropriate kmax values. We show how to choose k(max) dynamically to adapt to the actual system workload and performance at runtime, without requiring accurate prior knowledge.
引用
收藏
页码:189 / 200
页数:12
相关论文
共 25 条
[1]  
AKINDE MO, 1998, P 1998 INT C EXT DAT
[2]  
[Anonymous], MATERIALIZED VIEWS T
[3]   UPDATING DERIVED RELATIONS - DETECTING IRRELEVANT AND AUTONOMOUSLY COMPUTABLE UPDATES [J].
BLAKELEY, JA ;
COBURN, N ;
LARSON, PA .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1989, 14 (03) :369-400
[4]  
BRUNO N, 2002, ACM T DATABASE SYSTE
[5]  
Carey M. J., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P158
[6]  
CAREY MJ, 1997, P 1997 ACM SIGMOD IN
[7]  
CHANG KCC, 2002, P 2002 ACM SIGMOD IN
[8]  
CHEN CM, 2002, P 2002 INT C DAT ENG
[9]   INSERTING A NEW ELEMENT INTO A HEAP [J].
DOBERKAT, EE .
BIT, 1981, 21 (03) :255-269
[10]  
Donjerkovic D, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P411