A COMPLETE AND EXTENDIBLE APPROACH TO VISUAL RECOGNITION

被引:5
作者
BOLLE, RM [1 ]
CALIFANO, A [1 ]
KJELDSEN, R [1 ]
机构
[1] IBM CORP,THOMAS J WATSON RES CTR,DEPT COMP SCI,DEPT ARTIFICIAL INTELLIGENCE,YORKTOWN HTS,NY 10598
关键词
CONNECTIONIST NETWORKS; CONSTRAINT SATISFACTION NETWORKS; FEATURES; HOUGH TRANSFORM; INDEXING; OBJECT MATCHING; OBJECT MODELING; OBJECT RECOGNITION; PARAMETER TRANSFORMS; RANGE DATA;
D O I
10.1109/34.134058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for 3-D object recognition. An important aspect of this framework is its flexibility and extendibility, which is accomplished through a uniform, parallel, and modular recognition architecture. Concurrent and stacked parameter transforms reconstruct a variety of features from the input scene. These transforms are either based on data, data and reconstructed features, or combinations of reconstructed features. At each stage, constraint satisfaction networks collect and fuse the evidence obtained through the parameter transforms. This process ensures a globally consistent interpretation of the input scene and allows for the integration of diverse types of information. The final interpretation of the scene is a small consistent subset of the many initial hypotheses about partial features, primitive features, feature assemblies, and 3D objects computed by the various parameter transforms. This paper reports on a complete, integrated (and implemented) system that extracts planar surfaces, patches of quadrics of revolution, and planar intersection curves of these surfaces (lines and conic sections in three space) from a depth map viewing 3-D objects. The reconstructed primitive features are used to index into an object model database to form hypotheses about objects in the scene. Integration of the various modules is a significant aspect of this work. Experimental results detailing the recognition behavior of the system are presented.
引用
收藏
页码:534 / 548
页数:15
相关论文
共 48 条
[1]   VISUAL SHAPE COMPUTATION [J].
ALOIMONOS, J .
PROCEEDINGS OF THE IEEE, 1988, 76 (08) :899-916
[2]  
ALOIMONOS J, 1987, 1ST P INT C COMP VIS, P35
[3]  
BAHNU B, 1984, IEEE T PATTERN ANAL, V8, P137
[4]  
BAIRD HS, 1985, MODEL BASED IMAGE MA
[5]  
BALLARD DH, 1981, 7TH P INT JOINT C AR, P1068
[6]  
Besl P. J., 1988, Second International Conference on Computer Vision (IEEE Cat. No.88CH2664-1), P591, DOI 10.1109/CCV.1988.590039
[7]  
Binford T. O., 1982, INT J ROBOT RES, V1, P18
[8]   ON OPTIMALLY COMBINING PIECES OF INFORMATION, WITH APPLICATION TO ESTIMATING 3-D COMPLEX-OBJECT POSITION FROM RANGE DATA [J].
BOLLE, RM ;
COOPER, DB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (05) :619-638
[9]  
BOLLE RM, 1987, NOV P IEEE WORKSH CO, P324
[10]  
BOLLE RM, 1989, JUN P IEEE C COMP VI, P625