Seeking the strongest rigid detector

被引：153

作者：

Benenson, Rodrigo ^{[1
]}

Mathias, Markus ^{[1
]}

Tuytelaars, Tinne ^{[1
]}

Van Gool, Luc ^{[1
]}

机构：

[1] Katholieke Univ Leuven, ESAT PSI VISICS IBBT, Louvain, Belgium

来源：

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年

关键词：

D O I：

10.1109/CVPR.2013.470

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The current state of the art solutions for object detection describe each class by a set of models trained on discovered sub-classes (so called "components"), with each model itself composed of collections of interrelated parts (deformable models). These detectors build upon the now classic Histogram of Oriented Gradients+linear SVM combo. In this paper we revisit some of the core assumptions in HOG+SVM and show that by properly designing the feature pooling, feature selection, preprocessing, and training methods, it is possible to reach top quality, at least for pedestrian detections, using a single rigid component. We provide experiments for a large design space, that give insights into the design of classifiers, as well as relevant information for practitioners. Our best detector is fully feed-forward, has a single unified architecture, uses only histograms of oriented gradients and colour information in monocular static images, and improves over 23 other methods on the INRIA, ETH and Caltech-USA datasets, reducing the average miss-rate over HOG+SVM by more than 30%.

引用

页码：3666 / 3673

页数：8