Structure analysis of soccer video with domain knowledge and hidden Markov models

被引:122
作者
Xie, LX [1 ]
Xu, P
Chang, SF
Divakaran, A
Sun, HF
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[2] Mitsubishi Elect Res Labs, Murray Hill, NJ USA
关键词
sports video analysis; soccer video; hidden Markov models; dynamic programming; video syntax;
D O I
10.1016/j.patrec.2004.01.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present statistical techniques for parsing the structure of produced soccer programs. The problem is important for applications such as personalized video streaming and browsing systems, in which videos are segmented into different states and important states are selected based on user preferences. While prior work focuses on the detection of special events such as goals or corner kicks, this paper is concerned with generic structural elements of the game. We define two mutually exclusive states of the game, play and break based on the rules of soccer. Automatic detection of such generic states represents an original challenging issue due to high appearance diversities and temporal dynamics of such states in different videos. We select a salient feature set from the compressed domain, dominant color ratio and motion intensity, based on the special syntax and content characteristics of soccer videos. We then model the stochastic structures of each state of the game with a set of hidden Markov models. Finally, higher-level transitions are taken into account and dynamic programming techniques are used to obtain the maximum likelihood segmentation of the video sequence. The system achieves a promising classification accuracy of 83.5%, with light-weight computation on feature extraction and model inference, as well as a satisfactory accuracy in boundary timing. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:767 / 775
页数:9
相关论文
共 13 条
[1]  
[Anonymous], ICASSP 92
[2]  
*FIFA, 2002, LAWS GAM
[3]  
GONG Y, 1995, IEEE INT C MULT COMP, P167
[4]  
QIAN R, 2001, IEEE INT C MULT EXP
[5]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[6]  
SHOOK F, 1995, TELEVISION FIELD PRO, pCH12
[7]  
SUDHIR G, 1998, IEEE INT WORKSH CONT
[8]   Multimedia content analysis - Using both audio and visual clues [J].
Wang, Y ;
Liu, Z ;
Huang, JC .
IEEE SIGNAL PROCESSING MAGAZINE, 2000, 17 (06) :12-36
[9]  
*WIK, 2002, WIK FREE ENC
[10]  
XIE L, 2002, P INT C AC SPEECH SI