Cross-Domain Human Action Recognition

被引:69
作者
Bian, Wei [1 ]
Tao, Dacheng [1 ]
Rui, Yong [2 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
[2] Microsoft China R&D CRD Grp, Beijing 100190, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2012年 / 42卷 / 02期
关键词
Bag-of-words; cross-domain learning; human action recognition; topic models;
D O I
10.1109/TSMCB.2011.2166761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Conventional human action recognition algorithms cannot work well when the amount of training videos is insufficient. We solve this problem by proposing a transfer topic model (TTM), which utilizes information extracted from videos in the auxiliary domain to assist recognition tasks in the target domain. The TTM is well characterized by two aspects: 1) it uses the bag-of-words model trained from the auxiliary domain to represent videos in the target domain; and 2) it assumes each human action is a mixture of a set of topics and uses the topics learned from the auxiliary domain to regularize the topic estimation in the target domain, wherein the regularization is the summation of Kullback-Leibler divergences between topic pairs of the two domains. The utilization of the auxiliary domain knowledge improves the generalization ability of the learned topic model. Experiments on Weizmann and KTH human action databases suggest the effectiveness of the proposed TTM for cross-domain human action recognition.
引用
收藏
页码:298 / 307
页数:10
相关论文
共 44 条
[1]
[Anonymous], 2006, ADV NEURAL INFORM PR
[2]
[Anonymous], 2005, Advances in neural information processing systems
[3]
[Anonymous], P BRIT MACH VIS C LO
[4]
[Anonymous], CSD98980 UC BERK
[5]
[Anonymous], 2007, P 24 INT C MACH LEAR
[6]
[Anonymous], P IJCAI WORKSH MACH
[7]
[Anonymous], IEEE I CONF COMP VIS
[8]
[Anonymous], P IEEE INT C DAT MIN
[9]
[Anonymous], ACM T MULTIMEDIA COM
[10]
A model of inductive bias learning [J].
Baxter, J .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 :149-198