Approximate nearest neighbor searching in multimedia databases

被引:59
作者
Ferhatosmanoglu, H [1 ]
Tuncel, E [1 ]
Agrawal, D [1 ]
El Abbadi, A [1 ]
机构
[1] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
来源
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2001年
关键词
D O I
10.1109/ICDE.2001.914864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper; we develop a general framework for approximate nearest neighbor queries. We categorize the current approaches for nearest neighbor query processing based on either their ability to reduce the data set that needs to be examined, or their ability to reduce the representation size of each data object We first propose modifications to well-known techniques to support the progressive processing of approximate nearest neighbor queries. A user may therefore stop the retrieval process once enough information has been returned. We then develop a new technique based on clustering that merges the benefits of the two general classes of approaches. Our cluster-based approach allows a user to progressively explore the approximate results with increasing accuracy. We propose a new metric for evaluation of approximate nearest neighbor searching techniques. Using both the proposed and the traditional metrics, ute analyze and compare several techniques with a detailed performance evaluation. We demonstrate the feasibility and efficiency of approximate nearest neighbor searching. We perform experiments on several real data sets and establish the superiority of the proposed cluster-based technique over the existing techniques for approximate nearest neighbor searching.
引用
收藏
页码:503 / 511
页数:9
相关论文
共 17 条
[1]  
Agrawal R., 1993, Foundations of Data Organization and Algorithms. 4th International Conference. FODO '93 Proceedings, P69
[2]  
Arya S., 1995, Proceedings of the Eleventh Annual Symposium on Computational Geometry, P172, DOI 10.1145/220279.220298
[3]  
BERNSTEIN P, 1998, ACM SIGMOD RECORD, V27
[4]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[5]  
Ciaccia P., 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073), P244, DOI 10.1109/ICDE.2000.839417
[6]  
Egecioglu O., 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P219, DOI 10.1145/354756.354822
[7]  
Ferhatosmanoglu H., 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P202, DOI 10.1145/354756.354820
[8]  
FERHATOSMANOGLU H, 2000, TRCS0024 UC COMP SCI
[9]   Multidimensional access methods [J].
Gaede, V ;
Gunther, O .
ACM COMPUTING SURVEYS, 1998, 30 (02) :170-231
[10]  
Gionis A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P518