An experimental study on pedestrian classification

被引:373
作者
Munder, S.
Gavrila, D. M.
机构
[1] DaimlerChrysler Res & Dev, Machine Percept Dept, D-89081 Ulm, Germany
[2] Univ Amsterdam, Fac Sci, Intelligent Syst Lab, NL-1098 SJ Amsterdam, Netherlands
关键词
pedestrian classification; feature evaluation; classifier evaluation; performance analysis;
D O I
10.1109/TPAMI.2006.217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting people in images is key for several important application domains in computer vision. This paper presents an in-depth experimental study on pedestrian classification; multiple feature-classifier combinations are examined with respect to their ROC performance and efficiency. We investigate global versus local and adaptive versus nonadaptive features, as exemplified by PCA coefficients, Haar wavelets, and local receptive fields (LRFs). In terms of classifiers, we consider the popular Support Vector Machines (SVMs), feed-forward neural networks, and k-nearest neighbor classifier. Experiments are performed on a large data set consisting of 4,000 pedestrian and more than 25,000 nonpedestrian (labeled) images captured in outdoor urban environments. Statistically meaningful results are obtained by analyzing performance variances caused by varying training and test sets. Furthermore, we investigate how classification performance and training sample size are correlated. Sample size is adjusted by increasing the number of manually labeled training data or by employing automatic bootstrapping or cascade techniques. Our experiments show that the novel combination of SVMs with LRF features performs best. A boosted cascade of Haar wavelets can, however, reach quite competitive results, at a fraction of computational cost. The data set used in this paper is made public, establishing a benchmark for this important problem.
引用
收藏
页码:1863 / 1868
页数:6
相关论文
共 15 条
[1]  
[Anonymous], IEEE T INTELLIGENT T
[2]  
[Anonymous], 1999, COMPUTER VISION, DOI DOI 10.1109/ICCV.1999.791202
[3]   DISTANCE TRANSFORMATIONS IN DIGITAL IMAGES [J].
BORGEFORS, G .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 34 (03) :344-371
[4]   A motion and shape-based pedestrian detection algorithm [J].
Elzein, H ;
Lakshmanan, S ;
Watta, P .
IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, :500-504
[5]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[6]   NEOCOGNITRON - A NEURAL NETWORK MODEL FOR A MECHANISM OF VISUAL-PATTERN RECOGNITION [J].
FUKUSHIMA, K ;
MIYAKE, S ;
ITO, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :826-834
[7]   The visual analysis of human movement: A survey [J].
Gavrila, DM .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 73 (01) :82-98
[8]   Statistical pattern recognition: A review [J].
Jain, AK ;
Duin, RPW ;
Mao, JC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) :4-37
[9]   Example-based object detection in images by components [J].
Mohan, A ;
Papageorgiou, C ;
Poggio, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (04) :349-361
[10]   A one-step finite element formulation for the modeling of single and double-circuit transmission lines [J].
Papagiannis, GK ;
Triantafyllidis, DG ;
Labridis, DP .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2000, 15 (01) :33-38