Human activity recognition using multidimensional indexing

被引:148
作者
Ben-Arie, J [1 ]
Wang, ZQ [1 ]
Pandit, P [1 ]
Rajaram, S [1 ]
机构
[1] Univ Illinois, ECE Dept, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
human activity recognition; multidimensional indexing; sequence recognition; human body part tracking; EXpansion Matching (EXM);
D O I
10.1109/TPAMI.2002.1023805
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we develop a novel method for view-based recognition of human action/activity from videos. By observing just a few frames, we can identify the activity that takes place in a video sequence. The basic idea of our method is that activities can be positively identified from a sparsely sampled sequence of a few body poses acquired from videos. In our approach, an activity is represented by a set of pose and velocity vectors for the major body parts (hands, legs, and torso) and stored in a set of multidimensional hash tables. We develop a theoretical foundation that shows that robust recognition of a sequence of body pose vectors can be achieved by a method of indexing and sequencing and it requires only a few pose vectors (i.e., sampled body poses in video frames). We find that the probability of false alarm drops exponentially with the increased number of sampled body poses. So, matching only a few body poses guarantees high probability for correct recognition. Our approach is parallel, i.e., all possible model activities are examined at one indexing operation since all of the model activities are stored in the same set of hash tables. In addition, our method is robust to partial occlusion since each body part is indexed separately. We use a sequence-based voting approach to recognize the activity invariant to the activity speed. Experiments performed with videos having eight different activities show robust recognition with our method. The method is also robust in conditions of varying view angle in the range of 30 degrees.
引用
收藏
页码:1091 / 1104
页数:14
相关论文
共 28 条
[1]  
Barron C, 2000, PROC CVPR IEEE, P669, DOI 10.1109/CVPR.2000.855884
[2]   A Novel Approach for Template Matching by Nonorthogonal Image Expansion [J].
Ben-Arie, Jezekiel ;
Rao, K. Raghunath .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1993, 3 (01) :71-84
[3]   OPTIMAL TEMPLATE MATCHING BY NONORTHOGONAL IMAGE EXPANSION USING RESTORATION [J].
BENARIE, J ;
RAO, KR .
MACHINE VISION AND APPLICATIONS, 1994, 7 (02) :69-81
[4]  
Bobick A, 1996, P 13 INT C PATT REC
[5]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267
[6]  
Di Bernardo E., 1996, Proceedings of the 13th International Conference on Pattern Recognition, P622, DOI 10.1109/ICPR.1996.547021
[7]  
FUJIYOSHI H, 1998, P WORKSH APPL COMP V
[8]   Learning variable-length Markov models of behavior [J].
Galata, A ;
Johnson, N ;
Hogg, D .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2001, 81 (03) :398-413
[9]   The visual analysis of human movement: A survey [J].
Gavrila, DM .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 73 (01) :82-98
[10]   W4:: Real-time surveillance of people and their activities [J].
Haritaoglu, I ;
Harwood, D ;
Davis, LS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :809-830