Indexing the solution space: A new technique for nearest neighbor search in high-dimensional space

被引:43
作者
Berchtold, S
Keim, DA
Kriegel, HP
Seidl, T
机构
[1] Stb Software Technol Beratung GMBH, D-86150 Augsburg, Germany
[2] Univ Halle Wittenberg, Inst Comp Sci, D-06099 Halle An Der Saale, Germany
[3] Univ Munich, Inst Comp Sci, D-80538 Munich, Germany
关键词
nearest neighbor search; high-dimensional indexing; efficient query processing; spatial databases; Voronoi diagrams;
D O I
10.1109/69.842249
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity search in multimedia databases requires an efficient support of nearest-neighbor search on a large set of high-dimensional points as a basic operation for query processing. As recent theoretical results show, state of the art approaches to nearest-neighbor search are not efficient in higher dimensions. In our new approach, we therefore precompute the result of any nearest-neighbor search which corresponds to a computation of the Voronoi cell of each data point. In a second step, we store conservative approximations of the Voronoi cells in an index structure efficient for high-dimensional data spaces. As a result, nearest neighbor search corresponds to a simple point query on the index structure. Although our technique is based on a precomputation of the solution space, it is dynamic, i.e., it supports insertions of new data points. An extensive experimental evaluation of our technique demonstrates the high efficiency for uniformly distributed as well as real data. We obtained a significant reduction of the search time compared to nearest neighbor search in other index structures such as the X-tree.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 33 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]  
[Anonymous], P 1998 ACM SIGMOD IN
[3]  
ARYA S, 1995, THESIS U MARYLAND CO
[4]   Enclosing a set of objects by two minimum area rectangles [J].
Becker, B ;
Franciosa, PG ;
Gschwind, S ;
Leonardi, S ;
Ohler, T ;
Widmayer, P .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 1996, 21 (03) :520-541
[5]  
BECKER B, 1992, P 9 ANN S THEOR ASP, P475
[6]  
BECKMANN N, 1990, SIGMOD REC, V19, P322, DOI 10.1145/93605.98741
[7]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[8]  
Berchtold S., 1997, Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS 1997, P78, DOI 10.1145/263661.263671
[9]  
BERCHTOLD S, 1997, P ACM SIGMOD INT C M, P1
[10]  
BERCHTOLD S, 1998, P 6 INT C EXT DAT TE