Robust visual tracking by integrating multiple cues based on co-inference learning

被引:112
作者
Wu, Y
Huang, TS
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
[2] Univ Illinois, Beckman Inst, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
visual tracking; sequential Monte Carlo; importance sampling; co-inference; factorized graphical model; variational analysis;
D O I
10.1023/B:VISI.0000016147.97880.cd
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual tracking can be treated as a parameter estimation problem that infers target states based on image observations from video sequences. A richer target representation may incur better chances of successful tracking in cluttered and dynamic environments, and thus enhance the robustness. Richer representations can be constructed by either specifying a detailed model of a single cue or combining a set of rough models of multiple cues. Both approaches increase the dimensionality of the state space, which results in a dramatic increase of computation. To investigate the integration of rough models from multiple cues and to explore computationally efficient algorithms, this paper formulates the problem of multiple cue integration and tracking in a probabilistic framework based on a factorized graphical model. Structured variational analysis of such a graphical model factorizes different modalities and suggests a co-inference process among these modalities. Based on the importance sampling technique, a sequential Monte Carlo algorithm is proposed to provide an efficient simulation and approximation of the co-inferencing of multiple cues. This algorithm runs in real-time at around 30 Hz. Our extensive experiments show that the proposed algorithm performs robustly in a large variety of tracking scenarios. The approach presented in this paper has the potential to solve other problems including sensor fusion problems.
引用
收藏
页码:55 / 71
页数:17
相关论文
共 41 条
[31]   COLOR INDEXING [J].
SWAIN, MJ ;
BALLARD, DH .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1991, 7 (01) :11-32
[32]  
Tanner M.A., 1993, TOOLS STAT INFERENCE, V2nd
[33]  
Tao H, 2000, PROC CVPR IEEE, P134, DOI 10.1109/CVPR.2000.854760
[34]  
TAO H, 1999, P ICCV 99 WORKSH VIS
[35]   Incremental focus of attention for robust visual tracking [J].
Toyama, K ;
Hager, GD .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :189-195
[36]  
TOYAMA K, 2000, P EUR C COMP VIS IRL
[37]  
TOYAMA K, 1999, INT C COMP VIS, P255, DOI DOI 10.1109/ICCV.1999.791228
[38]   Pfinder: Real-time tracking of the human body [J].
Wren, CR ;
Azarbayejani, A ;
Darrell, T ;
Pentland, AP .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :780-785
[39]  
Wu Y, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P26, DOI 10.1109/ICCV.2001.937590
[40]  
Wu Y, 2001, IEEE SIGNAL PROC MAG, V18, P51