Rapid object detection using a boosted cascade of simple features

被引:8777
作者
Viola, P [1 ]
Jones, M [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
来源
2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS | 2001年
关键词
D O I
10.1109/cvpr.2001.990517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates. This work is distinguished by three key contributions. The first is the introduction of a new image representation called the "Integral Image" which allows the features used by our detector to be computed very quickly. The second is a learning algorithm, based on AdaBoost, which selects a small number of critical visual features from a larger set and yields extremely efficient classifiers[5]. The third contribution is a method for combining increasingly more complex classifiers in a "cascade" which allows background regions of the image to be quickly discarded while spending more computation on promising object-like regions. The cascade can be viewed as an object specific focus-of-attention mechanism which unlike previous approaches provides statistical guarantees that discarded regions are unlikely to contain the object of interest. In the domain of face detection the system yields detection rates comparable to the best previous systems. Used in real-time applications, the detector runs at 15 frames per second without resorting to image differencing or skin color detection.
引用
收藏
页码:511 / 518
页数:8
相关论文
共 17 条
  • [1] AMIT Y, 1997, JOINT INDUCTION SHAP
  • [2] [Anonymous], 1997, P IEEE C COMP VIS PA
  • [3] [Anonymous], 1998, INT C COMP VIS
  • [4] Crow F. C., 1984, Computers & Graphics, V18, P207
  • [5] FLEURET F, 2001, INT J COMPUTER VISIO
  • [6] THE DESIGN AND USE OF STEERABLE FILTERS
    FREEMAN, WT
    ADELSON, EH
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (09) : 891 - 906
  • [7] Freund Y., 1995, Computational Learning Theory. Second European Conference, EuroCOLT '95. Proceedings, P23
  • [8] GREENSPAN H, 1994, P IEEE C COMP VIS PA
  • [9] A model of saliency-based visual attention for rapid scene analysis
    Itti, L
    Koch, C
    Niebur, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) : 1254 - 1259
  • [10] Neural network-based face detection
    Rowley, HA
    Baluja, S
    Kanade, T
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 23 - 38