Genetic object recognition using combinations of views

被引:26
作者
Bebis, G [1 ]
Louis, S
Varol, Y
Yfantis, A
机构
[1] Univ Nevada, Dept Comp Sci, Reno, NV 89557 USA
[2] Univ Nevada, Dept Comp Sci, Las Vegas, NV 89154 USA
基金
美国国家科学基金会;
关键词
algebraic functions of views; genetic algorithms; object recognition;
D O I
10.1109/4235.996013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the application of genetic algorithms (GAs) for recognizing real two-dimensional (2-D) or three-dimensional (3-D) objects from 2-D intensity images, assuming that the viewpoint is arbitrary. Our approach is model-based (i.e., we assume a predefined set of models), while our recognition strategy lies on the recently proposed theory of algebraic functions of views. According to this theory, the variety of 2-D views depicting an object can be expressed as a combination of a small number of 2-D views of the object. This implies a simple and powerful strategy for object recognition: novel 2-D views of an object (2-D or 3-D) can be recognized by simply matching them to combinations of known 2-D views of the object. In other words, objects in a scene are recognized by "predicting" their appearance through the combination of known views of the objects. This is an important idea, which is also supported by psychophysical findings indicating that the human visual system works in a similar way. The main difficulty in implementing this idea is determining the parameters of the combination of views. This problem can be solved either in the space of feature matches among the views ("image space") or the space of parameters ("transformation space"). In general, both of these spaces are very large, making the search very time consuming. In this paper, we propose using GAs to search these spaces efficiently. To improve the efficiency of genetic search in the transformation space, we use singular value decomposition and interval arithmetic to restrict genetic search in the most feasible regions of the transformation space. The effectiveness of the GA approaches is shown on a set of increasingly complex real scenes where exact and near-exact matches are found reliably and quickly.
引用
收藏
页码:132 / 146
页数:15
相关论文
共 39 条
[1]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[2]  
[Anonymous], P EUR C COMP VIS
[3]  
[Anonymous], SIAM J NUMERICAL A 2
[4]  
ANSARI N, 1992, DYNAMIC GENETIC CHAO, P353
[5]   GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES [J].
BALLARD, DH .
PATTERN RECOGNITION, 1981, 13 (02) :111-122
[6]  
Basri R., 1994, Applications of Invariance in Computer Vision. Second Joint European - US Workshop Proceedings, P435
[7]  
BASRI R, 1993, CVGIP-IMAG UNDERSTAN, V57, P331, DOI 10.1006/ciun.1993.1022
[8]  
BASRI R, PARAPERSPECTIVE AFFI
[9]  
Bebis G., 1999, Proceedings 1999 International Conference on Information Intelligence and Systems (Cat. No.PR00446), P360, DOI 10.1109/ICIIS.1999.810289
[10]   Indexing based on algebraic functions of views [J].
Bebis, G ;
Georgiopoulos, M ;
Shah, M ;
Lobo, ND .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 72 (03) :360-378