Fusing Multiple Features for Depth-Based Action Recognition

被引：52

作者：

Zhu, Yu ^{[1
]}

Chen, Wenbin ^{[1
]}

Guo, Guodong ^{[1
]}

机构：

[1] W Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA

来源：

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY | 2015年 / 6卷 / 02期

关键词：

Algorithms; Experimentation; Performance; Human Factors; RGB-D sensor; depth maps; action recognition; spatiotemporal features; skeleton; 4D descriptor; data fusion; decision level; feature level; feature selection; CLASSIFIER FUSION;

D O I：

10.1145/2629483

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

Human action recognition is a very active research topic in computer vision and pattern recognition. Recently, it has shown a great potential for human action recognition using the three-dimensional (3D) depth data captured by the emerging RGB-D sensors. Several features and/or algorithms have been proposed for depth-based action recognition. A question is raised: Can we find some complementary features and combine them to improve the accuracy significantly for depth-based action recognition? To address the question and have a better understanding of the problem, we study the fusion of different features for depth-based action recognition. Although data fusion has shown great success in other areas, it has not been well studied yet on 3D action recognition. Some issues need to be addressed, for example, whether the fusion is helpful or not for depth-based action recognition, and how to do the fusion properly. In this article, we study different fusion schemes comprehensively, using diverse features for action characterization in depth videos. Two different levels of fusion schemes are investigated, that is, feature level and decision level. Various methods are explored at each fusion level. Four different features are considered to characterize the depth action patterns from different aspects. The experiments are conducted on four challenging depth action databases, in order to evaluate and find the best fusion methods generally. Our experimental results show that the four different features investigated in the article can complement each other, and appropriate fusion methods can improve the recognition accuracies significantly over each individual feature. More importantly, our fusion-based action recognition outperforms the state-of-the-art approaches on these challenging databases.

引用

页数：20

共 52 条

[1]

Human Activity Analysis: A Review [J].