机构:
MIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USAMIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
Papageorgiou, CP
[1
]
Oren, M
论文数: 0引用数: 0
h-index: 0
机构:
MIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USAMIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
Oren, M
[1
]
Poggio, T
论文数: 0引用数: 0
h-index: 0
机构:
MIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USAMIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
Poggio, T
[1
]
机构:
[1] MIT, Artificial Intelligence Lab, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
来源:
SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION
|
1998年
关键词:
D O I:
10.1109/ICCV.1998.710772
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
This paper presents a general trainable framework for object detection in static images of cluttered scenes. The detection technique we develop is based on a wavelet representation of an object class derived from a statistical analysis of the class instances. By learning an object class in terms of a subset of an overcomplete dictionary of wavelet basis functions, we derive a compact representation of an object class which is used as an input to a support vector machine classifier. This representation overcomes both the problem of in-class variability and provides a low false detection rate in unconstrained environments. We demonstrate the capabilities of the technique in two domains whose inherent information content differs significantly. The first system is face detection and the Second is the domain of people which, in contrast to faces, vary greatly in color, texture, and patterns. Unlike previous approaches, this system learns front examples and does not rely on any a priori (handcrafted) models or motion-based segmentation. The paper also presents a motion-based extension to enhance the performance of the detection algorithm over video sequences. The results presented here suggest that this architecture may well be quite general.