Activity based surveillance video content modelling

被引:32
作者
Mang, Tao [1 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, Dept Comp Sci, London E1 4NS, England
基金
英国工程与自然科学研究理事会;
关键词
video content analysis; activity recognition; surveillance video segmentation; dynamic Bayesian networks; dynamic scene modelling; unusual activity detection;
D O I
10.1016/j.patcog.2007.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper tackles the problem of surveillance video content modelling. Given a set of surveillance videos, the aims of our work are twofold: firstly a continuous video is segmented according to the activities captured in the video; secondly a model is constructed for the video content, based on which an unseen activity pattern can be recognised and any unusual activities can be detected. To segment a video based on activity, we propose a semantically meaningful video content representation method and two segmentation algorithms, one being offline offering high accuracy in segmentation, and the other being online enabling real-time performance. Our video content representation method is based on automatically detected visual events (i.e. 'what is happening in the scene'). This is in contrast to most previous approaches which represent video content at the signal level using image features such as colour, motion and texture. Our segmentation algorithms are based on detecting breakpoints on a high-dimensional video content trajectory. This differs from most previous approaches which are based on shot change detection and shot grouping. Having segmented continuous surveillance videos based on activity, the activity patterns contained in the video segments are grouped into activity classes and a composite video content model is constructed which is capable of generalising from a small training set to accommodate variations in unseen activity patterns. A run-time accumulative unusual activity measure is introduced to detect unusual behaviour while usual activity patterns are recognised based on an online likelihood ratio test (LRT) method. This ensures robust and reliable activity recognition and unusual activity detection at the shortest possible time once sufficient visual evidence has become available. Comparative experiments have been carried out using over 10 h of challenging outdoor surveillance video footages to evaluate the proposed segmentation algorithms and modelling approach. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2309 / 2326
页数:18
相关论文
共 44 条
  • [31] Exploring video structure beyond the shots
    Rui, Y
    Huang, TS
    Mehrotra, S
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS, 1998, : 237 - 240
  • [32] ESTIMATING DIMENSION OF A MODEL
    SCHWARZ, G
    [J]. ANNALS OF STATISTICS, 1978, 6 (02) : 461 - 464
  • [33] Approximate queries and representations for large data sequences
    Shatkay, H
    Zdonik, SB
    [J]. PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 536 - 545
  • [34] SHERRAH J, 2001, P EUR WORKSH ADV VID
  • [35] Normalized cuts and image segmentation
    Shi, JB
    Malik, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 888 - 905
  • [36] Learning patterns of activity using real-time tracking
    Stauffer, C
    Grimson, WEL
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 747 - 757
  • [37] Vasconcelos N., 1998, ICIP, P550
  • [38] WEISS Y, 1999, IEEE INT C COMP VIS, P975
  • [39] AUTOMATIC RECOGNITION OF KEYWORDS IN UNCONSTRAINED SPEECH USING HIDDEN MARKOV-MODELS
    WILPON, JG
    RABINER, LR
    LEE, CH
    GOLDMAN, ER
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (11): : 1870 - 1878
  • [40] Xiang T, 2005, IEEE I CONF COMP VIS, P1238