FAST ALGORITHMS FOR UNIVERSAL QUANTIFICATION IN LARGE DATABASES

被引:21
作者
GRAEFE, G [1 ]
COLE, RL [1 ]
机构
[1] PORTLAND STATE UNIV,PORTLAND,OR
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 1995年 / 20卷 / 02期
关键词
ALGORITHMS; EXPERIMENTATION;
D O I
10.1145/210197.210202
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Universal quantification is not supported directly in most database systems despite the fact that it adds significant power to a system's query processing and inference capabilities, in particular for the analysis of many-to-many relationships and of set-valued attributes. One of the main reasons for this omission has been that universal quantification algorithms and their performance have not been explored for large databases. In this article, we describe and compare three known algorithms and one recently proposed algorithm for relational division, the algebra operator that embodies universal quantification. For each algorithm, we investigate the performance effects of explicit duplicate removal and referential integrity enforcement, variants for inputs larger than memory, and parallel execution strategies. Analytical and experimental performance comparisons illustrate the substantial differences among the algorithms. Moreover, comparisons demonstrate that the recently proposed division algorithm evaluates a universal quantification predicate over two relations as fast as hash (semi-) join evaluates an existential quantification predicate over the same relations. Thus, existential and universal quantification can be supported with equal efficiency by adding the recently proposed algorithm to a query evaluation system. A second result of our study is that universal quantification should be expressed directly in a database query language because most query optimizers do not recognize the rather indirect formulations available in SQL as relational division, and therefore produce very poor evaluation plans for many universal quantification queries.
引用
收藏
页码:187 / 236
页数:50
相关论文
共 31 条
[1]  
Astrahan M.M., ACM Trans. Database Syst. 1, 2 ( June ), 97. Reprinted in Stonebraker, M., 1988 Readings in Database Systems, (1976)
[2]  
Duplicate record elimination in large data files, ACM Trans. Database Syst. 8, (1983)
[3]  
Bratbergsengen K., Proc. Inti. Conf. on Very Large Data Bases, (1984)
[4]  
Carlis J.V., HAS: A relational algebra operator, or divide is not enough to conquer, (1986)
[5]  
Cheng J., Loosely C., Shibamiya A., Worthington P., IBM database 2 performance: design, implementation, and tuning, IBM Syst. J. 23, (1985)
[6]  
Codd E.F., Relational Completeness of Database Sublanguages, (1972)
[7]  
Date C.J., Guide to The SQL Standard, (1993)
[8]  
Dewitt D.J., Katz R., Olken F., Shapiro L., Stonebraker M., Implementation techniques for main memory database systems, Proc. ACM SIGMOD Conf., 1, (1984)
[9]  
Dewitt D.J., Multiprocessor hash-based join algorithms, Proc. Inti. Conf. on Very Large Data Bases, (1985)
[10]  
Dewitt D.J., Proc. Inti Conf. on Very Large Data Bases (Kyoto, Japan, August ), 228. Reprinted in Stonebraker, M., 1988. Readings in Database Systems, (1986)