Efficient Sort-Based Skyline Evaluation

被引:130
作者
Bartolini, Ilaria [1 ]
Ciaccia, Paolo [1 ]
Patella, Marco [1 ]
机构
[1] Univ Bologna, DEIS, Alma Mater Studiorum, I-40136 Bologna, Italy
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2008年 / 33卷 / 04期
关键词
Algorithms; Performance; Skyline query; Monotone functions;
D O I
10.1145/1412331.1412343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skyline queries compute the set of Pareto-optimal tuples in a relation, that is, those tuples that are not dominated by any other tuple in the same relation. Although several algorithms have been proposed for efficiently evaluating skyline queries, they either necessitate the relation to have been indexed or have to perform the dominance tests on all the tuples in order to determine the result. In this article we introduce SaLSa, a novel skyline algorithm that exploits the idea of presorting the input data so as to effectively limit the number of tuples to be read and compared. This makes SaLSa also attractive when skyline queries are executed on top of systems that do not understand skyline semantics, or when the skyline logic runs on clients with limited power and/or bandwidth. We prove that, if one considers symmetric sorting functions, the number of tuples to be read is minimized by sorting data according to a "minimum coordinate," minC, criterion, and that performance can be further improved if data distribution is known and an asymmetric sorting function is used. Experimental results obtained on synthetic and real datasets show that SaLSa consistently outperforms state-of-the-art sequential skyline algorithms and that its performance can be accurately predicted.
引用
收藏
页数:49
相关论文
共 24 条
[1]  
Acharya S, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P13, DOI 10.1145/304181.304184
[2]  
[Anonymous], 2006, PROC 22TH INT C DATA
[3]  
Balke WT, 2004, LECT NOTES COMPUT SC, V2992, P256
[4]   Flexible integration of multimedia sub-queries with qualitative preferences [J].
Ilaria Bartolini ;
Paolo Ciaccia ;
Vincent Oria ;
M. Tamer Özsu .
Multimedia Tools and Applications, 2007, 33 (3) :275-300
[5]  
Bartolini I., 2006, PROC ACM INT C INF K, P405
[6]  
BARTOLINI I, 2004, LECT NOTES COMPUTER, V2992, P66
[7]   The Skyline operator [J].
Börzsönyi, S ;
Kossmann, D ;
Stocker, K .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :421-430
[8]   ON THE AVERAGE NUMBER OF MAXIMA IN A SET OF VECTORS [J].
BUCHTA, C .
INFORMATION PROCESSING LETTERS, 1989, 33 (02) :63-65
[9]  
CHAUDHURI S, 2006, P INT C DAT ENG ICDE, P64
[10]   Preference formulas in relational queries [J].
Chomicki, J .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2003, 28 (04) :427-466