Ranked join indices

被引:34
作者
Tsaparas, P [1 ]
Palpanas, T [1 ]
Kotidis, Y [1 ]
Koudas, N [1 ]
Srivastava, D [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
来源
19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2003年
关键词
D O I
10.1109/ICDE.2003.1260799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of data sources contain data entities that could be ordered according to a variety of attributes associated with the entities. Such orderings result effectively in a ranking of the entities according to the values in the attribute domain. Commonly, users correlate such sources for query processing purposes through join operations. In query processing, it is desirable to incorporate user preferences towards specific attributes or their values. A way to incorporate such preferences is by utilizing scoring functions that combine user preferences and attribute values and return a numerical score for each tuple in the join result. Then, a target query, which we refer to as top-k join query, seeks to identify the k tuples in the join result with the highest scores. In this paper we propose a novel technique, which we refer to as ranked join index, to efficiently answer top-k join queries for arbitrary, user specified, preferences and a large class of scoring functions. Our rank join index requires small space (compared to the entire join result) and provides guarantees for its performance. Moreover, our proposal provides a graceful tradeoff between its space requirements and worst case search performance. We supplement our analytical results with a thorough experimental evaluation using a variety of real and synthetic data sets, demonstrating that, in comparison to other viable approaches, our technique offers significant performance benefits.
引用
收藏
页码:277 / 288
页数:12
相关论文
共 16 条
[1]  
AGRAWAL R, 2000, P ACM SIGMOD INT C M, P297, DOI DOI 10.1145/342009.335423
[2]  
BRUNO N, 2002, P ICDE APR
[3]  
CHANG K, 2002, P ACM SIGMOD JUN
[4]  
CHICHANG Y, 2000, P ACM SIGMOD JUN, P391
[5]  
DONJERKOVIC D, 1999, P VLDB AUG
[6]  
Fagin R., 1996, Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1996, P216, DOI 10.1145/237661.237715
[7]  
Fagin R., 1998, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1998, P1, DOI 10.1145/275487.275488
[8]  
FAGIN R, 1997, ICDT, P247
[9]  
Gravano L., 1999, P VLDB AUG
[10]  
Guttman A., 1984, P ACM SIGMOD INT C M, P47, DOI DOI 10.1145/602259.602266