Temporal spatio-velocity transform and its application to tracking and interaction

被引:103
作者
Sato, K [1 ]
Aggarwal, JK [1 ]
机构
[1] Univ Texas, Dept Elect & Comp Engn, Comp & Vis Res Ctr, Austin, TX 78712 USA
关键词
temporal spatio-velocity transform; Hough transform; spatio-temporal; windowing; human segmentation; tracking; interaction recognition;
D O I
10.1016/j.cviu.2004.02.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
This paper describes the temporal spatio-velocity (TSV) transform for extracting pixel velocities from binary image sequences. The TSV transform is derived from the Hough transform over windowed spatio-temporal images. We present the methodology of the transform and its implementation in an iterative computational form. The intensity at each pixel in the TSV image represents a measure of the likelihood of occurrence of a pixel with instantaneous velocity in the current position. Binarization of the TSV image extracts blobs based on the similarity of velocity and position. The TSV transform provides an efficient way to remove noise by focusing on stable velocities, and constructs noise-free blobs. We apply the transform to tracking human figures in a sidewalk environment and extend its use to an interaction recognition system. The system performs background subtraction to separate the foreground image from the background, extracts standing human objects and generates a one-dimensional binary image sequence. The TSV transform takes the one-dimensional image sequence and yields the TSV images. Thresholding of the TSV image generates the human blobs. We obtain the human trajectories by associating the segmented blobs over time using blob features. We analyze the motion-state transitions of human interactions, which we consider to be combinations of ten simple interaction units (SIUs). Our system recognizes the 10 SIUs by analyzing the shape of the human trajectory. We illustrate the TSV transform and its application to real images for human segmentation, tracking and interaction classification. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:100 / 128
页数:29
相关论文
共 19 条
[1]
Human motion analysis: A review [J].
Aggarwal, JK ;
Cai, Q .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 73 (03) :428-440
[2]
Recognizing human actions in a static room [J].
Ayers, D ;
Shah, M .
FOURTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV'98, PROCEEDINGS, 1998, :42-47
[3]
Bobick AF, 2001, IEEE T PATTERN ANAL, V23, P257, DOI 10.1109/34.910878
[4]
BOBICK AF, 1996, P BRIT MACH VIS C, V1, P13
[5]
Tracking human motion in structured environments using a distributed-camera system [J].
Cai, Q ;
Aggarwal, JK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (11) :1241-1247
[6]
A Novel Technique for Image-Velocity Computation [J].
Chong, Chu Phoon ;
Salama, Andre T. ;
Smith, Kenneth C. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1992, 2 (03) :313-318
[7]
W4:: Who?: When?: Where?: What?: A real time system for detecting and tracking people [J].
Haritaoglu, I ;
Harwood, D ;
Davis, LS .
AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, :222-227
[8]
Hydra:: Multiple people detection and tracking using silhouettes [J].
Haritaoglu, I ;
Harwood, D ;
Davis, LS .
SECOND IEEE WORKSHOP ON VISUAL SURVEILLANCE (VS'99), PROCEEDINGS, 1999, :6-13
[9]
DETERMINING OPTICAL-FLOW [J].
HORN, BKP ;
SCHUNCK, BG .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :185-203
[10]
FAST CONVERGENT METHOD FOR OPTICAL-FLOW ESTIMATION IN NOISY IMAGE SEQUENCES [J].
KIM, JD ;
KIM, SD ;
KIM, JK .
ELECTRONICS LETTERS, 1989, 25 (01) :74-75