A generative framework for real time object detection and classification

被引:95
作者
Fasel, I [1 ]
Fortenberry, B
Movellan, J
机构
[1] Univ Calif San Diego, Inst Neural Computat, San Diego, CA 92103 USA
[2] Univ Calif San Diego, Dept Cognit Sci, San Diego, CA 92103 USA
基金
美国国家科学基金会;
关键词
blink detection; eye detection; boosting; generative models;
D O I
10.1016/j.cviu.2004.07.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We formulate a probabilistic model of image generation and derive optimal inference algorithms for finding objects and object features within this framework. The approach models images as a collage of patches of arbitrary size, some of which contain the object of interest and some of which are background. The approach requires development of likelihood-ratio models for object versus background generated patches. These models are learned using boosting methods. One advantage of the generative approach proposed here is that it makes explicit the conditions under which it is optimal. We applied the approach to the problem of finding faces and eyes on arbitrary images. Optimal inference under the proposed model works in real time and is robust to changes in lighting, illumination, and differences in facial structure, including facial expressions and eyeglasses. Furthermore, the system can simultaneously track the eyes and blinks of multiple individuals. Finally we reflect on how the development of perceptive systems like this may help advance our understanding of the human brain. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:182 / 210
页数:29
相关论文
共 38 条
[1]  
Baron-Cohen S., 1995, MINDBLINDNESS ESSAY, DOI DOI 10.7551/MITPRESS/4635.001.0001
[2]  
BARTLETT MS, IN PRESS ADV NEURAL, V15
[3]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[4]  
COHN JF, IN PRESS BEHAV RES M
[5]  
Cottrell G.W., 2003, Computational, Geometric, and Process Perspectives on Facial Cognition: Contexts and Challenges
[6]  
EDLEMAN S, 2001, INT ENCY SOCIAL BEHA
[7]  
Ekman P., 2009, TELLING LIES CLUES D
[8]  
Ekman P., 1978, Facial action coding system: A technique for the measurement of facial movement, DOI DOI 10.1708/1069.11717
[9]   What is "special" about face perception? [J].
Farah, MJ ;
Wilson, KD ;
Drain, M ;
Tanaka, JN .
PSYCHOLOGICAL REVIEW, 1998, 105 (03) :482-498
[10]   Eye contact detection in humans from birth [J].
Farroni, T ;
Csibra, G ;
Simion, G ;
Johnson, MH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (14) :9602-9605