Pedestrian Detection: An Evaluation of the State of the Art

被引:2130
作者
Dollar, Piotr [1 ]
Wojek, Christian [2 ]
Schiele, Bernt [2 ]
Perona, Pietro [1 ]
机构
[1] CALTECH, Dept Elect Engn, Pasadena, CA 91125 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Pedestrian detection; object detection; benchmark; evaluation; data set; Caltech Pedestrian data set; STATISTICAL COMPARISONS; OBJECT DETECTION; CLASSIFICATION; CLASSIFIERS; FEATURES; TRACKING; SYSTEM; SCALE;
D O I
10.1109/TPAMI.2011.155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection is a key problem in computer vision, with several applications that have the potential to positively impact quality of life. In recent years, the number of approaches to detecting pedestrians in monocular images has grown steadily. However, multiple data sets and widely varying evaluation protocols are used, making direct comparisons difficult. To address these shortcomings, we perform an extensive evaluation of the state of the art in a unified framework. We make three primary contributions: 1) We put together a large, well-annotated, and realistic monocular pedestrian detection data set and study the statistics of the size, position, and occlusion patterns of pedestrians in urban scenes, 2) we propose a refined per-frame evaluation methodology that allows us to carry out probing and informative comparisons, including measuring performance in relation to scale and occlusion, and 3) we evaluate the performance of sixteen pretrained state-of-the-art detectors across six data sets. Our study allows us to assess the state of the art and provides a framework for gauging future efforts. Our experiments show that despite significant progress, performance still has much room for improvement. In particular, detection is disappointing at low resolutions and for partially occluded pedestrians.
引用
收藏
页码:743 / 761
页数:19
相关论文
共 88 条
  • [41] García S, 2008, J MACH LEARN RES, V9, P2677
  • [42] Multi-cue pedestrian detection and tracking from a moving vehicle
    Gavrila, D. M.
    Munder, S.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 73 (01) : 41 - 59
  • [43] Gavrila D. M., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P87, DOI 10.1109/ICCV.1999.791202
  • [44] A Bayesian, exemplar-based approach to hierarchical shape matching
    Gavrila, Dariu M.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (08) : 1408 - 1421
  • [45] Geronimo D., 2005, P INT C COMP VIS SYS
  • [46] Survey of Pedestrian Detection for Advanced Driver Assistance Systems
    Geronimo, David
    Lopez, Antonio M.
    Sappa, Angel D.
    Graf, Thorsten
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (07) : 1239 - 1258
  • [47] Griffin G., 2007, Caltech-256 object category dataset
  • [48] TEXTURAL FEATURES FOR IMAGE CLASSIFICATION
    HARALICK, RM
    SHANMUGAM, K
    DINSTEIN, I
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06): : 610 - 621
  • [49] Hussain S., 2010, P BRIT MACH VIS C
  • [50] A Comprehensive Evaluation Framework and a Comparative Study for Human Detectors
    Hussein, Mohamed
    Porikli, Fatih
    Davis, Larry
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2009, 10 (03) : 417 - 427