Human activity recognition using multidimensional indexing

被引：148

作者：

Ben-Arie, J ^{[1
]}

Wang, ZQ ^{[1
]}

Pandit, P ^{[1
]}

Rajaram, S ^{[1
]}

机构：

[1] Univ Illinois, ECE Dept, Chicago, IL 60607 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2002年 / 24卷 / 08期

基金：

美国国家科学基金会;

关键词：

human activity recognition; multidimensional indexing; sequence recognition; human body part tracking; EXpansion Matching (EXM);

D O I：

10.1109/TPAMI.2002.1023805

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we develop a novel method for view-based recognition of human action/activity from videos. By observing just a few frames, we can identify the activity that takes place in a video sequence. The basic idea of our method is that activities can be positively identified from a sparsely sampled sequence of a few body poses acquired from videos. In our approach, an activity is represented by a set of pose and velocity vectors for the major body parts (hands, legs, and torso) and stored in a set of multidimensional hash tables. We develop a theoretical foundation that shows that robust recognition of a sequence of body pose vectors can be achieved by a method of indexing and sequencing and it requires only a few pose vectors (i.e., sampled body poses in video frames). We find that the probability of false alarm drops exponentially with the increased number of sampled body poses. So, matching only a few body poses guarantees high probability for correct recognition. Our approach is parallel, i.e., all possible model activities are examined at one indexing operation since all of the model activities are stored in the same set of hash tables. In addition, our method is robust to partial occlusion since each body part is indexed separately. We use a sequence-based voting approach to recognize the activity invariant to the activity speed. Experiments performed with videos having eight different activities show robust recognition with our method. The method is also robust in conditions of varying view angle in the range of 30 degrees.

引用

页码：1091 / 1104

页数：14

共 28 条

[1]

Barron C, 2000, PROC CVPR IEEE, P669, DOI 10.1109/CVPR.2000.855884

[2] A Novel Approach for Template Matching by Nonorthogonal Image Expansion [J].