Pedestrian Detection: An Evaluation of the State of the Art

被引:2130
作者
Dollar, Piotr [1 ]
Wojek, Christian [2 ]
Schiele, Bernt [2 ]
Perona, Pietro [1 ]
机构
[1] CALTECH, Dept Elect Engn, Pasadena, CA 91125 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Pedestrian detection; object detection; benchmark; evaluation; data set; Caltech Pedestrian data set; STATISTICAL COMPARISONS; OBJECT DETECTION; CLASSIFICATION; CLASSIFIERS; FEATURES; TRACKING; SYSTEM; SCALE;
D O I
10.1109/TPAMI.2011.155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection is a key problem in computer vision, with several applications that have the potential to positively impact quality of life. In recent years, the number of approaches to detecting pedestrians in monocular images has grown steadily. However, multiple data sets and widely varying evaluation protocols are used, making direct comparisons difficult. To address these shortcomings, we perform an extensive evaluation of the state of the art in a unified framework. We make three primary contributions: 1) We put together a large, well-annotated, and realistic monocular pedestrian detection data set and study the statistics of the size, position, and occlusion patterns of pedestrians in urban scenes, 2) we propose a refined per-frame evaluation methodology that allows us to carry out probing and informative comparisons, including measuring performance in relation to scale and occlusion, and 3) we evaluate the performance of sixteen pretrained state-of-the-art detectors across six data sets. Our study allows us to assess the state of the art and provides a framework for gauging future efforts. Our experiments show that despite significant progress, performance still has much room for improvement. In particular, detection is disappointing at low resolutions and for partially occluded pedestrians.
引用
收藏
页码:743 / 761
页数:19
相关论文
共 88 条
  • [81] Robust real-time face detection
    Viola, P
    Jones, MJ
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 57 (02) : 137 - 154
  • [82] Walk S., 2010, P EUR C COMP VIS
  • [83] Walk S., 2010, CVPR
  • [84] An HOG-LBP Human Detector with Partial Occlusion Handling
    Wang, Xiaoyu
    Han, Tony X.
    Yan, Shuicheng
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 32 - 39
  • [85] Weber M., 2000, P EUR C COMP VIS
  • [86] Wojek C, 2009, PROC CVPR IEEE, P794, DOI 10.1109/CVPRW.2009.5206638
  • [87] Wu B., 2005, P 10 IEEE INT C COMP
  • [88] Zhang W., 2007, P IEEE INT C COMP VI