Visual Object Detection with Deformable Part Models

被引:22
作者
Felzenszwalb, Pedro [1 ,2 ]
Girshick, Ross [3 ]
McAllester, David [4 ]
Ramanan, Deva [5 ]
机构
[1] Brown Univ, Sch Engn, Providence, RI 02912 USA
[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
[3] Univ Calif Berkeley, EECS, Berkeley, CA USA
[4] Toyota Technol Inst, Chicago, IL USA
[5] UC Irvine, Dept Comp Sci, Irvine, CA USA
基金
美国国家科学基金会;
关键词
RECOGNITION;
D O I
10.1145/2500468.2494532
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a state-of-the-art system for finding objects in cluttered images. Our system is based on deformable models that represent objects using local part templates and geometric constraints on the locations of parts. We reduce object detection to classification with latent variables. The latent variables introduce invariances that make it possible to detect objects with highly variable appearance. We use a generalization of support vector machines to incorporate latent information during training. This has led to a general framework for discriminative training of classifiers with latent variables. Discriminative training benefits from large training datasets. In practice we use an iterative algorithm that alternates between estimating latent values for positive examples and solving a large convex optimization problem. Practical optimization of this large convex problem can be done using active set techniques for adaptive subsampling of the training data.
引用
收藏
页码:97 / 105
页数:9
相关论文
共 34 条
[1]   POP:: Patchwork of parts models for object recognition [J].
Amit, Yali ;
Trouve, Alain .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 75 (02) :267-282
[2]  
[Anonymous], 1991, Hands: A Pattern Theoretic Study of Biological Shapes
[3]  
[Anonymous], IEEE C COMP VIS PATT
[4]  
[Anonymous], 2003, Advances in Neural Information Processing Systems
[5]  
Burl M. C., 1998, Computer Vision - ECCV'98. 5th European Conference on Computer Vision. Proceedings, P628, DOI 10.1007/BFb0054769
[6]   Active appearance models [J].
Cootes, TF ;
Edwards, GJ ;
Taylor, CJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) :681-685
[7]   Efficient deformable template detection and localization without user initialization [J].
Coughlan, J ;
Yuille, A ;
English, C ;
Snow, D .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 78 (03) :303-319
[8]  
Crandall D., 2005, IEEE C COMP VIS PATT
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Dalal N., 2005, IEEE C COMP VIS PATT