Robust Visual Tracking via Structured Multi-Task Sparse Learning

被引：314

作者：

Zhang, Tianzhu ^{[1
]}

Ghanem, Bernard ^{[2
]}

Liu, Si ^{[3
]}

Ahuja, Narendra ^{[4
,5
]}

机构：

[1] ADSC, Singapore 138632, Singapore

[2] KAUST, Thuwal, Saudi Arabia

[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore

[4] Univ Illinois, Dept Elect & Comp Engn, Beckman Inst, Beckman Inst 2041, Urbana, IL 61801 USA

[5] Univ Illinois, Coordinated Sci Lab, Beckman Inst 2041, Urbana, IL 61801 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2013年 / 101卷 / 02期

关键词：

Visual tracking; Particle filter; Graph; Structure; Sparse representation; Multi-task learning; OBJECT TRACKING; MODELS;

D O I：

10.1007/s11263-012-0582-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we formulate object tracking in a particle filter framework as a structured multi-task sparse learning problem, which we denote as Structured Multi-Task Tracking (S-MTT). Since we model particles as linear combinations of dictionary templates that are updated dynamically, learning the representation of each particle is considered a single task in Multi-Task Tracking (MTT). By employing popular sparsity-inducing mixed norms and we regularize the representation problem to enforce joint sparsity and learn the particle representations together. As compared to previous methods that handle particles independently, our results demonstrate that mining the interdependencies between particles improves tracking performance and overall computational complexity. Interestingly, we show that the popular tracker (Mei and Ling, IEEE Trans Pattern Anal Mach Intel 33(11):2259-2272, 2011) is a special case of our MTT formulation (denoted as the tracker) when Under the MTT framework, some of the tasks (particle representations) are often more closely related and more likely to share common relevant covariates than other tasks. Therefore, we extend the MTT framework to take into account pairwise structural correlations between particles (e.g. spatial smoothness of representation) and denote the novel framework as S-MTT. The problem of learning the regularized sparse representation in MTT and S-MTT can be solved efficiently using an Accelerated Proximal Gradient (APG) method that yields a sequence of closed form updates. As such, S-MTT and MTT are computationally attractive. We test our proposed approach on challenging sequences involving heavy occlusion, drastic illumination changes, and large pose variations. Experimental results show that S-MTT is much better than MTT, and both methods consistently outperform state-of-the-art trackers.

引用

页码：367 / 383

页数：17

共 42 条

[1] Adam A., 2006, IEEE C COMPUTER VISI, V1, P798, DOI [DOI 10.1109/CVPR.2006.256, 10.1109/CVPR.2006.256]
[2] [Anonymous], 2012, P 2012 WINT SIM C WS, DOI 10.1109/WSC.2012.6465056
[3] [Anonymous], 2001, Sequential Monte Carlo methods in practice
[4] [Anonymous], 2006, BMVC06
[5] [Anonymous], 2008, SIAM J OPTIMIZ
[6] [Anonymous], ECCV
[7] [Anonymous], IEEE C COMP VIS PATT
[8] [Anonymous], 2010, 2010 IEEE INT WORKSH
[9] Avidan S, 2005, PROC CVPR IEEE, P494
[10] Babenko B, 2009, PROC CVPR IEEE, P983, DOI 10.1109/CVPRW.2009.5206737

← 1 2 3 4 5 →