Space-Time Tradeoffs for Approximate Nearest Neighbor Searching

被引:42
作者
Arya, Sunil [1 ]
Malamatos, Theocharis [2 ]
Mount, David M. [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] Univ Peloponnese, Dept Comp Sci & Technol, Tripoli 22100, Libya
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
Algorithms; Theory; Nearest neighbor searching; space-time tradeoffs; MULTIDIMENSIONAL POINT SETS; DIMENSIONAL SPACES; TREES; QUERIES;
D O I
10.1145/1613676.1613677
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nearest neighbor searching is the problem of preprocessing a set of n point points in d-dimensional space so that, given any query point q, it is possible to report the closest point to q rapidly. In approximate nearest neighbor searching, a parameter epsilon > 0 is given, and a multiplicative error of (1 + epsilon) is allowed. We assume that the dimension d is a constant and treat n and epsilon as asymptotic quantities. Numerous solutions have been proposed, ranging from low-space solutions having space O(n) and query time O(log n+1/epsilon(d-1)) to high-space solutions having space roughly O((n log n)/epsilon(d)) and query time O(log(n/epsilon)). We show that there is a single approach to this fundamental problem, which both improves upon existing results and spans the spectrum of space-time tradeoffs. Given a tradeoff parameter gamma, where 2 <= gamma <= 1/epsilon, we show that there exists a data structure of space O(n gamma(d-1) log(1/epsilon)) that can answer queries in time O(log(n gamma) + 1/(epsilon gamma)((d-1)/2)). When gamma = 2, this yields a data structure of space O(n log(1/epsilon)) that can answer queries in time O(log n + 1/epsilon((d-1)/2)). When gamma = 1/epsilon, it provides a data structure of space O((n/epsilon(d-1)) log(1/epsilon)) that can answer queries in time O(log(n/epsilon)). Our results are based on a data structure called a (t, epsilon)-AVD, which is a hierarchical quadtree-based subdivision of space into cells. Each cell stores up to t representative points of the set, such that for any query point q in the cell at least one of these points is an approximate nearest neighbor of q. We provide new algorithms for constructing AVDs and tools for analyzing their total space requirements. We also establish lower bounds on the space complexity of AVDs, and show that, up to a factor of O(log(1/epsilon)), our space bounds are asymptotically tight in the two extremes, gamma = 2 and gamma = 1/epsilon.
引用
收藏
页数:54
相关论文
共 45 条
[1]  
[Anonymous], 1987, ALGORITHMS COMBINATO
[2]  
[Anonymous], P 8 CAN C COMP GEOM
[3]  
Arya S, 2002, SIAM PROC S, P147
[4]   Approximate range searching [J].
Arya, S ;
Mount, DM .
COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2000, 17 (3-4) :135-152
[5]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[6]  
Arya S., 2006, P 38 ACM S THEOR COM, P564
[7]  
Arya S, 2008, LECT NOTES COMPUT SC, V5193, P112, DOI 10.1007/978-3-540-87744-8_10
[8]  
Arya S, 2005, PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P535
[9]   Tradeoffs in Approximate Range Searching Made Simpler [J].
Arya, Sunil ;
da Fonseca, Guilherme D. ;
Mount, David M. .
SIBGRAPI 2008: XXI BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, 2008, :237-+
[10]   The Effect of Corners on the Complexity of Approximate Range Searching [J].
Arya, Sunil ;
Malamatos, Theocharis ;
Mount, David M. .
DISCRETE & COMPUTATIONAL GEOMETRY, 2009, 41 (03) :398-443