Discovery and segmentation of activities in video

被引:190
作者
Brand, M
Kettnaker, V
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
关键词
video activity monitoring; hidden Markov models; hidden state; parameter estimation; entropy minimization;
D O I
10.1109/34.868685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models (HMMs) have become the workhorses of the monitoring and event recognition literature because they bring to time-series analysts the utility of density estimation and the convenience of dynamic time warping. Once trained. the internals of these models are considered opaque; there is no effort to interpret the hidden states. We show that by minimizing the entropy of the joint distribution, an HMM's internal state machine can be made to organize observed activity into meaningful states. This has uses in video monitoring and annotation, low bit-rate coding of scene activity. and detection of anomalous behavior. We demonstrate with models of office activity and outdoor traffic, showing how the framework learns principal modes of activity and patterns of activity change. We then show how this framework can be adapted to infer hidden state from extremely ambiguous images. In particular, inferring 3D body orientation and pose from sequences of low-resolution silhouettes.
引用
收藏
页码:844 / 851
页数:8
相关论文
共 18 条
[1]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[2]  
BENGIO Y, 1995, ADV NEURAL INFORMATI, V7, P553
[3]   Structure learning in conditional probability models via an entropic prior and parameter extinction [J].
Brand, M .
NEURAL COMPUTATION, 1999, 11 (05) :1155-1182
[4]  
BRAND M, 1999, P INT C COMP VIS
[5]  
BRAND M, 2000, P INT C MACH LEARN
[6]  
BRAND M, 1999, ARTIFICIAL INTELLIGE
[7]   Using adaptive tracking to classify and monitor activities in a site [J].
Grimson, WEL ;
Stauffer, C ;
Romano, R ;
Lee, L .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :22-29
[8]  
Jelinek F., 1998, Statistical Methods for Speech Recognition
[9]  
JUANG BH, 1986, IEEE T INFORM THEORY, V32, P307, DOI 10.1109/TIT.1986.1057145