Range search on multidimensional uncertain data

被引:99
作者
Tao, Yufei [1 ]
Xiao, Xiaokui
Cheng, Reynold
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2007年 / 32卷 / 03期
关键词
algorithms; experimentation; uncertain databases; range search;
D O I
10.1145/1272743.1272745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an uncertain database, every object o is associated with a probability density function, which describes the likelihood that o appears at each position in a multidimensional workspace. This article studies two types of range retrieval fundamental to many analytical tasks. Specifically, a nonfuzzy query returns all the objects that appear in a search region r(q) with at least a certain probability t(q). On the other hand, given an uncertain object q, fuzzy search retrieves the set of objects that are within distance epsilon(q) from q with no less than probability tq. The core of our methodology is a novel concept of "probabilistically constrained rectangle", which permits effective pruning/validation of nonqualifying/qualifying data. We develop a new index structure called the U-tree for minimizing the query overhead. Our algorithmic findings are accompanied with a thorough theoretical analysis, which reveals valuable insight into the problem characteristics, and mathematically confirms the efficiency of our solutions. We verify the effectiveness of the proposed techniques with extensive experiments.
引用
收藏
页数:54
相关论文
共 42 条
[1]  
[Anonymous], 2000, COMPUTATIONAL GEOMET
[2]  
[Anonymous], P ACM SIGMOD MAY
[3]  
[Anonymous], P VER LARG DAT BAS L
[4]   THE MANAGEMENT OF PROBABILISTIC DATA [J].
BARBARA, D ;
GARCIAMOLINA, H ;
PORTER, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (05) :487-502
[5]  
BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
[6]   Querying imprecise data in moving object environments [J].
Cheng, R ;
Kalashnikov, DV ;
Prabhakar, S .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (09) :1112-1127
[7]  
Cheng R., 2003, P 2003 ACM SIGMOD IN, P551, DOI DOI 10.1145/872757
[8]  
Cheng R., 2004, VLDB 04 P 30 INT C V, V30, P876
[9]  
Cheng R, 2006, LECT NOTES COMPUT SC, V4258, P393
[10]   Evaluation of probabilistic queries over imprecise data in constantly-evolving environments [J].
Cheng, Reynold ;
Kalashnikov, Dmitri V. ;
Prabhakar, Sunil .
INFORMATION SYSTEMS, 2007, 32 (01) :104-130