Simultaneous object recognition and segmentation from single or multiple model views

被引:99
作者
Ferrari, Vittorio [1 ]
Tuytelaars, Tinne
Van Gool, Luc
机构
[1] Swiss Fed Inst Technol, BIWI, Comp Vis Grp, Zurich, Switzerland
[2] Univ Louvain, ESAT, PSI, Louvain, Belgium
关键词
Extractor; Object Recognition; Invariant Region; Region Detector; Correct Match;
D O I
10.1007/s11263-005-3964-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel Object Recognition approach based on affine invariant regions. It actively counters the problems related to the limited repeatability of the region detectors, and the difficulty of matching, in the presence of large amounts of background clutter and particularly challenging viewing conditions. After producing an initial set of matches, the method gradually explores the surrounding image areas, recursively constructing more and more matching regions, increasingly farther from the initial ones. This process covers the object with matches, and simultaneously separates the correct matches from the wrong ones. Hence, recognition and segmentation are achieved at the same time. The approach includes a mechanism for capturing the relationships between multiple model views and exploiting these for integrating the contributions of the views at recognition time. This is based on an efficient algorithm for partitioning a set of region matches into groups lying on smooth surfaces. Integration is achieved by measuring the consistency of configurations of groups arising from different model views. Experimental results demonstrate the stronger power of the approach in dealing with extensive clutter, dominant occlusion, and large scale and viewpoint changes. Non-rigid deformations are explicitly taken into account, and the approximative contours of the object are produced. All presented techniques can extend any view-point invariant feature extractor.
引用
收藏
页码:159 / 188
页数:30
相关论文
共 36 条
[1]  
[Anonymous], BMVC
[2]  
Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899
[3]  
BEBIS G, 1995, ICCV, P543
[4]  
CHUM O, 2003, COMP VIS WINT WORKSH
[5]  
Cyr CM, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P254, DOI 10.1109/ICCV.2001.937526
[6]  
FERRARI V, 2003, CVPR, V1, P718
[7]  
FERRARI V, 2004, ECCV, V1, P40
[8]  
FERRARI V, 2004, CVPR
[9]  
FERRARI V, 2004, THESI SELECTED READI
[10]  
Kolmogorov V, 2002, LECT NOTES COMPUT SC, V2352, P65