Aggregate operators in probabilistic databases

被引:46
作者
Ross, R [1 ]
Subrahmanian, VS
Grant, J
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Towson Univ, Dept Comp & Informat Sci, Towson, MD USA
[3] Towson Univ, Dept Math, Towson, MD USA
关键词
algorithms; performance; theory; aggregates; probabilistic relational databases;
D O I
10.1145/1044731.1044734
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Though extensions to the relational data model have been proposed in order to handle probabilistic information, there has been very little work to date on handling aggregate operators in such databases. In this article, we present a very general notion of an aggregate operator and show how classical aggregation operators (such as COUNT, SUM, etc.) as well as statistical operators (such as percentiles, variance, etc.) are special cases of this general definition. We devise a formal linear programming based semantics for computing aggregates over probabilistic DBMSs, develop algorithms that satisfy this semantics, analyze their complexity, and introduce several families of approximation algorithms that run in polynomial time. We implemented all of these algorithms and tested them on a large set of data to help determine when each one is preferable.
引用
收藏
页码:54 / 101
页数:48
相关论文
共 37 条
[1]  
Aitchison J., 1975, Statistical Prediction Analysis
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[3]   THE MANAGEMENT OF PROBABILISTIC DATA [J].
BARBARA, D ;
GARCIAMOLINA, H ;
PORTER, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (05) :487-502
[4]   Action recognition using probabilistic parsing [J].
Bobick, AF ;
Ivanov, YA .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :196-202
[5]  
Cavallo R., 1987, Proceedings of the Thirteenth International Conference on Very Large Data Bases: 1987 13th VLDB, P71
[6]  
Cheng R., 2003, P 2003 ACM SIGMOD IN, P551, DOI DOI 10.1145/872757
[7]  
CORMEN TH, 1992, INTRO ALGORITHMS
[8]  
Cramton P, 1997, J ECON MANAGE STRAT, V6, P431
[9]   Probabilistic temporal databases, I: Algebra [J].
Dekhtyar, A ;
Ross, R ;
Subrahmanian, VS .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2001, 26 (01) :41-95
[10]  
Dekhtyar A, 1997, LOGIC PROGRAMM, P391