SPY-TEC: An efficient indexing method for similarity search in high-dimensional data spaces

被引:52
作者
Lee, DH [1 ]
Kim, HJ [1 ]
机构
[1] Seoul Natl Univ, Dept Comp Engn, OOPSLA Lab, Seoul 151742, South Korea
关键词
similarity search; high-dimensional index technique; multimedia database;
D O I
10.1016/S0169-023X(00)00009-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of all index structures based on the R-tree have failed to support efficient indexing mechanisms for similarity search in high-dimensional data spaces. This is due to the fact that most of the index structures commonly use balanced split strategy in order to guarantee storage utilization and the shape of queries for similarity search is a hypersphere in high-dimensional spaces. In this paper, we propose the Spherical Pyramid-Technique (SPY-TEC), an efficient indexing method for similarity search in high-dimensional data space. The SPY-TEC is based on a special partitioning strategy, which is to divide the ri-dimensional data space first into 2d spherical pyramids, and then cut the single spherical pyramid into several spherical slices. This partition provides a transformation of d-dimensional space into one-dimensional space as the Pyramid-Technique [14] does. Thus, we are able to use a B(+)-tree to manage the transformed one-dimensional data. We also propose the algorithms to process hyperspherical range queries on the data space partitioned by this partitioning strategy. Finally, we show that the SPY-TEC clearly outperforms other related techniques including the Pyramid-Technique in processing hyperspherical range queries through various experiments using synthetic and real data. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:77 / 97
页数:21
相关论文
共 17 条
[1]  
[Anonymous], P ACM SIGMOD INT C M
[2]  
BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
[3]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[4]   Fast nearest neighbor search in high-dimensional space [J].
Berchtold, S ;
Ertl, B ;
Keim, DA ;
Kriegel, HP ;
Seidl, T .
14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, :209-218
[5]  
BERCHTOLD S, 1997, ACM PODS S PRINC DAT, P78
[6]  
Beyer Kevin., 1999, INT C DATABASE THEOR, P217, DOI DOI 10.1007/3-540-49257-7_15
[7]  
Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238
[8]  
FALOUTSOS C, 1995, DATA ENG B, V18, P31
[9]  
JACOBS CE, 1995, P 1995 ACM SIGGRAPH
[10]  
Kamel I, 1994, Proc. International Conference on Very Large Databases, P500