A transform for multiscale image segmentation by integrated edge and region detection

被引:65
作者
Ahuja, N [1 ]
机构
[1] UNIV ILLINOIS, DEPT ELECT & COMP ENGN, URBANA, IL 61801 USA
基金
美国国家科学基金会;
关键词
image segmentation; representation; scale-space; edge detection; region detection; perceptual structure; pyramids; medial axis; nonlinear image analysis; texture;
D O I
10.1109/34.546258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a new transform to extract image regions at all geometric and photometric scales. It is argued that linear approaches such as convolution and matching have the fundamental shortcoming that they require a priori models of region shape. The proposed transform avoids this limitation by letting the structure emerge, bottom-up, from interactions among pixels, in analogy with statistical mechanics and particle physics. The transform involves global computations on pairs of pixels followed by vector integration of the results, rather than scalar and local linear processing. An attraction force field is computed over the image in which pixels belonging to the same region are mutually attracted and the region is characterized by a convergent flow. It is shown that the transform possesses properties that allow multiscale segmentation, or extraction of original, unblurred structure at all different geometric and photometric scales present in the image. This is in contrast with much of the previous work wherein multiscale structure is viewed as the smoothed structure in a multiscale decimation of image signal. Scale is an integral parameter of the force computation, and the number and values of scale parameters associated with the image can be estimated automatically. Regions are detected at all, a priori unknown, scales resulting in automatic construction of a segmentation tree, in which each pixel is annotated with descriptions of all the regions it belongs to. Although some of the analytical properties of the transform are presented for piecewise constant images, it is shown that the results hold for more general images, e.g., those containing noise and shading. thus the proposed method is intended as a solution to the problem of multiscale, integrated edge and region detection, or low-level image segmentation. Experimental results with synthetic and real images are given to demonstrate the properties and segmentation performance of the transform.
引用
收藏
页码:1211 / 1235
页数:25
相关论文
共 36 条
[31]   EDGE AND CURVE DETECTION FOR VISUAL SCENE ANALYSIS [J].
ROSENFELD, A ;
THURSTON, M .
IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (05) :562-+
[32]  
ROSENFELD A, 1981, DIGITAL PICTURE PROC, V2
[33]  
Subirana-Vilanova J. B., 1990, Proceedings. Third International Conference on Computer Vision (Cat. No.90CH2934-8), P702, DOI 10.1109/ICCV.1990.139622
[34]  
WHITAKER R, 1992, SHAPE PICTURE MATH D, P641
[35]  
Witkin A, 1983, INT JOINT C ART INT
[36]  
WU L, 1990, IEEE T PATTERN ANAL, V12, P46, DOI 10.1109/TPAMI.1986.4767748