Towards real-time 3-D monocular visual tracking of human limbs in unconstrained environments

被引:10
作者
Bullock, D
Zelek, J [1 ]
机构
[1] Univ Waterloo, Waterloo, ON N2L 3G1, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON N1G 2W1, Canada
关键词
D O I
10.1016/j.rti.2005.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The 3-D visual tracking of human limbs is fundamental to a wide array of computer vision applications including gesture recognition, interactive entertainment, biomechanical analysis, vehicle driver monitoring, and electronic surveillance. The problem of limb tracking is complicated by issues of occlusion, depth ambiguities, rotational ambiguities, and high levels of noise caused by loose fitting clothing. We attempt to solve the 3-D limb tracking problem using only monocular imagery (a single 2-D video source) in largely unconstrained environments. The approach presented is a movement towards full real-time operating capabilities. The described system presents a complete visual tracking system which incorporates target detection, target model acquisition/ initialization, and target tracking components into a single, cohesive, probabilistic framework. The presence of a target is detected, using visual cues alone, by recognition of an individual performing a simple pre-defined initialization cue. The physical dimensions of the limb are then learned probabilistically until a statistically stable model estimate has been found. The appearance of the limb is learned in a joint spatial-chromatic domain which incorporates normalized color data with spatial constraints in order to model complex target appearances. The target tracking is performed within a Monte Carlo particle filtering framework which is capable of maintaining multiple state-space hypotheses and propagating ambiguity until less ambiguous data is observed. Multiple image cues are combined within this framework in a principled Bayesian manner. The target detection and model acquisition components are able to perform at near real-time frame rates and are shown to accurately recognize the presence of a target and initialize a target model specific to that user. The target tracking component has demonstrated exceptional resilience to occlusion and temporary target disappearance and contains a natural mechanism for the trade-off between accuracy and speed. At this point, the target tracking component performs at sub real-time frame rates, although several methods to increase the effective operating speed are proposed. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:323 / 353
页数:31
相关论文
共 31 条
[1]  
[Anonymous], 487 MIT MED LAB
[2]  
Bar-Shalom Y., 1988, Tracking and Data Association
[3]  
Bhatia S., 2004, IEEE WORKSH ART NONR
[4]   ELITE - A GOAL ORIENTED VISION SYSTEM FOR MOVING-OBJECTS DETECTION [J].
BORGHESE, NA ;
DIRIENZO, M ;
FERRIGNO, G ;
PEDOTTI, A .
ROBOTICA, 1991, 9 :275-282
[5]  
BROWN C, 1995, TR534 U ROCH
[6]  
CAMUS T, 1998, J REAL TIME IMAGING, P71
[7]   The representation and recognition of human movement using temporal templates [J].
Davis, JW ;
Bobick, AF .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :928-934
[8]  
DELLAERT F, 1997, SPIE, V3207, P72
[9]   Color-based tracking of heads and other mobile objects at video frame rates [J].
Fieguth, P ;
Terzopoulos, D .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :21-27
[10]   The visual analysis of human movement: A survey [J].
Gavrila, DM .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 73 (01) :82-98