Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations

被引:58
作者
Yi, Ke [1 ]
Li, Feifei [2 ]
Kollios, George [3 ]
Srivastava, Divesh [4 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
[3] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[4] AT&T Labs Res, Florham Pk, NJ 07932 USA
基金
美国国家科学基金会;
关键词
Algorithm; probabilistic data; query processing; top-k; uncertain database; x-relation;
D O I
10.1109/TKDE.2008.90
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces novel polynomial algorithms for processing top-k queries in uncertain databases under the generally adopted model of x-relations. An x-relation consists of a number of x-tuples, and each x-tuple randomly instantiates into one tuple from one or more alternatives. Our results significantly improve the best known algorithms for top-k query processing in uncertain databases, in terms of both runtime and memory usage. In the single-alternative case, the new algorithms are 2 to 3 orders of magnitude faster than the previous algorithms. In the multialternative case, we introduce the first-known polynomial algorithms, while the current best algorithms have exponential complexity in both time and space. Our algorithms run in near linear or low polynomial time and cover both types of top-k queries in uncertain databases. We provide both the theoretical analysis and an extensive experimental evaluation to demonstrate the superiority of the new approaches over existing solutions.
引用
收藏
页码:1669 / 1682
页数:14
相关论文
共 29 条
[1]  
ABITEBOUL S, 1987, P ACM SIGMOD
[2]  
Agrawal P., 2006, P VLDB 2006 DEM DESC
[3]  
[Anonymous], P ACM SIGMOD
[4]  
[Anonymous], 1998, 1998 WORLD CUP WEB S
[5]  
ANTOVA L, 2007, P ACM SIGMOD
[6]  
Antova L., 2007, P IEEE INT C DAT ENG
[7]   THE MANAGEMENT OF PROBABILISTIC DATA [J].
BARBARA, D ;
GARCIAMOLINA, H ;
PORTER, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (05) :487-502
[8]  
Benjelloun O, 2006, P VER LARG DAT BAS
[9]  
Chaudhuri S., 2003, P ACM SIGMOD
[10]  
Dalvi Nilesh N., 2004, P INT C VER LARG DAT