Seeking the strongest rigid detector

被引:153
作者
Benenson, Rodrigo [1 ]
Mathias, Markus [1 ]
Tuytelaars, Tinne [1 ]
Van Gool, Luc [1 ]
机构
[1] Katholieke Univ Leuven, ESAT PSI VISICS IBBT, Louvain, Belgium
来源
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年
关键词
D O I
10.1109/CVPR.2013.470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current state of the art solutions for object detection describe each class by a set of models trained on discovered sub-classes (so called "components"), with each model itself composed of collections of interrelated parts (deformable models). These detectors build upon the now classic Histogram of Oriented Gradients+linear SVM combo. In this paper we revisit some of the core assumptions in HOG+SVM and show that by properly designing the feature pooling, feature selection, preprocessing, and training methods, it is possible to reach top quality, at least for pedestrian detections, using a single rigid component. We provide experiments for a large design space, that give insights into the design of classifiers, as well as relevant information for practitioners. Our best detector is fully feed-forward, has a single unified architecture, uses only histograms of oriented gradients and colour information in monocular static images, and improves over 23 other methods on the INRIA, ETH and Caltech-USA datasets, reducing the average miss-rate over HOG+SVM by more than 30%.
引用
收藏
页码:3666 / 3673
页数:8
相关论文
共 18 条
  • [1] Ali K., 2011, PAMI
  • [2] [Anonymous], 2012, CVPR
  • [3] [Anonymous], 2009, CVPR
  • [4] [Anonymous], 2009, BMVC
  • [5] [Anonymous], 2012, BMVC
  • [6] [Anonymous], 2009, ICCV
  • [7] [Anonymous], 2011, TPAMI
  • [8] [Anonymous], 2010, PAMI
  • [9] [Anonymous], 2010, ECCV
  • [10] [Anonymous], 2004, IJCV