Dynamic Template Tracking and Recognition

被引：11

作者：

Chaudhry, Rizwan ^{[1
]}

Hager, Gregory ^{[2
]}

Vidal, Rene ^{[1
]}

机构：

[1] Johns Hopkins Univ, Ctr Imaging Sci, Baltimore, MD 21218 USA

[2] Johns Hopkins Univ, Baltimore, MD USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2013年 / 105卷 / 01期

基金：

欧洲研究理事会;

关键词：

Dynamic templates; Dynamic textures; Human actions; Tracking; Linear dynamical systems; Recognition; CLASSIFICATION;

D O I：

10.1007/s11263-013-0625-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we address the problem of tracking non-rigid objects whose local appearance and motion changes as a function of time. This class of objects includes dynamic textures such as steam, fire, smoke, water, etc., as well as articulated objects such as humans performing various actions. We model the temporal evolution of the object's appearance/motion using a linear dynamical system. We learn such models from sample videos and use them as dynamic templates for tracking objects in novel videos. We pose the problem of tracking a dynamic non-rigid object in the current frame as a maximum a-posteriori estimate of the location of the object and the latent state of the dynamical system, given the current image features and the best estimate of the state in the previous frame. The advantage of our approach is that we can specify a-priori the type of texture to be tracked in the scene by using previously trained models for the dynamics of these textures. Our framework naturally generalizes common tracking methods such as SSD and kernel-based tracking from static templates to dynamic templates. We test our algorithm on synthetic as well as real examples of dynamic textures and show that our simple dynamics-based trackers perform at par if not better than the state-of-the-art. Since our approach is general and applicable to any image feature, we also apply it to the problem of human action tracking and build action-specific optical flow trackers that perform better than the state-of-the-art when tracking a human performing a particular action. Finally, since our approach is generative, we can use a-priori trained trackers for different texture or action classes to simultaneously track and recognize the texture or action in the video.

引用

页码：19 / 48

页数：30

共 43 条

[1] Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning [J].

Ali, Saad ;

Shah, Mubarak .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) :288-303

[2]

[Anonymous], BRIT MACH VIS C

[3]

[Anonymous], 2009, IEEE C COMP VIS PATT

[4]

[Anonymous], 2008, CVPR

[5]

[Anonymous], VISION APPL SPECIAL

[6]

[Anonymous], IEEE C COMP VIS PATT

[7]

[Anonymous], 2005, IEEE INT WORKSH PERF

[8]

[Anonymous], 0901 J HOPK U DEP CO

[9]

[Anonymous], IEEE C COMP VIS PATT

[10]

[Anonymous], 2009, IEEE C COMP VIS PATT

← 1 2 3 4 5 →