Affine-similar shape retrieval: Application to multiview 3-D object recognition

被引：27

作者：

Abbasi, S ^{[1
]}

Mokhtarian, F ^{[1
]}

机构：

[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 5XH, Surrey, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2001年 / 10卷 / 01期

基金：

英国工程与自然科学研究理事会;

关键词：

affine transform; curvature scale space; maxima of CSS contours; multiview object recognition; shape similarity retrieval;

D O I：

10.1109/83.892449

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The feasibility of representing a three-dimensional (3-D) object with a small number of standard views is studied in this paper. The object boundary of each view is considered as a two-dimensional (2-D) shape and is represented by the locations of the maxima of its curvature scale space (CSS) image contours, The idea is to identify an unknown object from an image taken from a random view by using the stored descriptions of the standard views. The CSS image has been selected for MPEG-7 standardization. The maxima of CSS image have already been used to represent 2-D shapes in different applications under similarity transforms. Since the new application involves affine transforms, we first examine the effects of general affine transforms on the representation and show that the locations of the maxima of the CSS image do not move dramatically even under large affine transformations. Our system for shape-based retrieval from large image databases is then applied to multiview 3-D object representation and recognition. Our collection of 3-D objects consists of 18 aircrafts of different shapes. Three silhouette contours corresponding to random, views are separately used as input for each object. Results indicate that robust and efficient 3-D free-form object recognition through multiview representation can be achieved using the CSS representation.

引用

页码：131 / 139

页数：9

共 26 条

[1] Curvature scale space image in shape similarity retrieval [J].

Abbasi, S ;

Mokhtarian, F ;

Kittler, J .

MULTIMEDIA SYSTEMS, 1999, 7 (06) :467-476

[2]

ABBASI S, 2000, P 15 INT C PATT REC

[3] USING DYNAMIC-PROGRAMMING FOR SOLVING VARIATIONAL-PROBLEMS IN VISION [J].

AMINI, AA ;

WEYMOUTH, TE ;

JAIN, RC .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (09) :855-867

[4] APPLICATION OF AFFINE-INVARIANT FOURIER DESCRIPTORS TO RECOGNITION OF 3-D OBJECTS [J].

ARBTER, K ;

SNYDER, WE ;

BURKHARDT, H ;

HIRZINGER, G .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (07) :640-647

[5] Fast-learning VIEWNET architectures for recognizing three-dimensional objects from multiple two-dimensional views [J].

Bradski, G ;

Grossberg, S .

NEURAL NETWORKS, 1995, 8 (7-8) :1053-1080

[6] SYMBOLIC REASONING AMONG 3-D MODELS AND 2-D IMAGES [J].

BROOKS, RA .

ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :285-348

[7]

COHIGNAC T, 1994, P INT C PATT REC, V1, P164

[8]

Cyganski D., 1987, Proceedings of the First International Conference on Computer Vision (Cat. No.87CH2465-3), P496

[9]

CYGANSKI D, 1991, P SOC PHOTO-OPT INS, V1607, P98

[10]

DUNKER J, 1996, P 13 INT C PATT REC, V4, P4

← 1 2 3 →