Activity based surveillance video content modelling

被引:32
作者
Mang, Tao [1 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, Dept Comp Sci, London E1 4NS, England
基金
英国工程与自然科学研究理事会;
关键词
video content analysis; activity recognition; surveillance video segmentation; dynamic Bayesian networks; dynamic scene modelling; unusual activity detection;
D O I
10.1016/j.patcog.2007.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper tackles the problem of surveillance video content modelling. Given a set of surveillance videos, the aims of our work are twofold: firstly a continuous video is segmented according to the activities captured in the video; secondly a model is constructed for the video content, based on which an unseen activity pattern can be recognised and any unusual activities can be detected. To segment a video based on activity, we propose a semantically meaningful video content representation method and two segmentation algorithms, one being offline offering high accuracy in segmentation, and the other being online enabling real-time performance. Our video content representation method is based on automatically detected visual events (i.e. 'what is happening in the scene'). This is in contrast to most previous approaches which represent video content at the signal level using image features such as colour, motion and texture. Our segmentation algorithms are based on detecting breakpoints on a high-dimensional video content trajectory. This differs from most previous approaches which are based on shot change detection and shot grouping. Having segmented continuous surveillance videos based on activity, the activity patterns contained in the video segments are grouped into activity classes and a composite video content model is constructed which is capable of generalising from a small training set to accommodate variations in unseen activity patterns. A run-time accumulative unusual activity measure is introduced to detect unusual behaviour while usual activity patterns are recognised based on an online likelihood ratio test (LRT) method. This ensures robust and reliable activity recognition and unusual activity detection at the shortest possible time once sufficient visual evidence has become available. Comparative experiments have been carried out using over 10 h of challenging outdoor surveillance video footages to evaluate the proposed segmentation algorithms and modelling approach. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2309 / 2326
页数:18
相关论文
共 44 条
  • [1] Event based indexing of broadcasted sports video by intermodal collaboration
    Babaguchi, N
    Kawai, Y
    Kitahashi, T
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 68 - 75
  • [2] Bishop CM., 1995, Neural networks for pattern recognition
  • [3] The recognition of human movement using temporal templates
    Bobick, AF
    Davis, JW
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) : 257 - 267
  • [4] Boiman O, 2005, IEEE I CONF COMP VIS, P462
  • [5] DEMENTHON D, 2000, ADV VISUAL INFORM SY
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] Enser P, 2003, LECT NOTES COMPUT SC, V2728, P291
  • [8] Bayesian network classifiers
    Friedman, N
    Geiger, D
    Goldszmidt, M
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 131 - 163
  • [9] GE X, 2000, AEC APC S, V12
  • [10] Ghahramani Z, 1998, LECT NOTES ARTIF INT, V1387, P168, DOI 10.1007/BFb0053999