DEFORMABLE KERNELS FOR EARLY VISION

被引:155
作者
PERONA, P [1 ]
机构
[1] UNIV PADUA, DIPARTIMENTO ELETRON & INFORMAT, PADUA, ITALY
基金
美国国家科学基金会;
关键词
STEERABLE FILTERS; WAVELETS; EARLY VISION; MULTIRESOLUTION IMAGE ANALYSIS; MULTIRATE FILTERING; DEFORMABLE FILTERS; SCALE-SPACE;
D O I
10.1109/34.391394
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early vision algorithms often have a first stage of linear-filtering that 'extracts' from the image information at multiple scales of resolution and multiple orientations. A common difficulty in the design and implementation of such schemes is that one feels compelled to discretize coarsely the space of scales and orientations in order to reduce computation and storage costs. This discretization produces anisotropies due to a loss of translation-, rotation-, and scaling-invariance that makes early vision algorithms less precise and more difficult to design. This need not be so: one can compute and store efficiently the response of families of linear filters defined on a continuum of orientations and scales. A technique is presented that allows 1) computing the best approximation of a given family using linear combinations of a small number of 'basis' functions; 2) describing all finite-dimensional families, i.e., the families of filters for which a finite dimensional representation is possible with no error. The technique is based on singular value decomposition and may be applied to generating filters in arbitrary dimensions and subject to arbitrary deformations; the relevant functional analysis results are reviewed and precise conditions for the decomposition to be feasible are stated. Experimental results are presented that demonstrate the applicability of the technique to generating multi-orientation multi-scale 2D edge-detection kernels. The implementation issues are also discussed.
引用
收藏
页码:488 / 499
页数:12
相关论文
共 66 条
[1]  
Hubel D., Wiesel T., Receptive fields, binocular interaction and functional architecture in the cat's visual cortex., J. Physiol. (Lond.), 160, pp. 106-154, (1962)
[2]  
Hubel D., Wiesel T., Receptive fields of single neurones in the cat's striate cortex., J. Physiol. (Lond.), 148, pp. 574-591, (1959)
[3]  
Horn B., “The binford-horn linefinder,”, (1971)
[4]  
Binford T., Inferring surfaces from images, Artificial Intelligence, 17, pp. 205-244, (1981)
[5]  
Granlund G.H., In search of a general picture processing operator, Computer Graphics and Image Processing, 8, pp. 155-173, (1978)
[6]  
Knuttson H., Granlund G.H., Texture analysis using two-dimensiona quadrature filters, Workshop on Computer Architecture for Pattern Analysis and Image Database Management, pp. 206-213, (1983)
[7]  
Kass M., Computing visual correspondence, Proc. Image Understanding Workshop, (McLean, Va.), pp. 54-60, (1983)
[8]  
Burt P., Adelson E., The laplacian pyramid as a compact image code, IEEE Trans. Communications, 31, pp. 532-540, (1983)
[9]  
Adelson E., Bergen J., Spatiotemporal energy models for the perception of motion, J. Opt. Soc. Am., 2, 2, pp. 284-299, (1985)
[10]  
Canny J., A computational approach to edge detection, IEEE Trans. Pattern Analysis and Machine Intelligence, 8, pp. 679-698, (1986)