Automatic soccer video analysis and summarization

被引:511
作者
Ekin, A
Tekalp, AM
Mehrotra, R
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
[2] Eastman Kodak Co, Entertainment Imaging Div, Rochester, NY 14650 USA
基金
美国国家科学基金会;
关键词
cinematic features; object-based features; semantic event detection; shot classification; slow-motion replay detection; soccer video processing; soccer video summarization;
D O I
10.1109/TIP.2003.812758
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We propose a fully automatic and computationally efficient videos using cinematic and object-based features. The proposed framework includes some novel low-level soccer video processing algorithms, such as dominant color region detection, robust shot boundary detection, and shot classification, as well as some higher-level algorithms for goal detection, referee detection, and penalty-box detection. The system can output three types of summaries: i) all slow-motion segments in a game, ii) all goals in a game, and iii) slow-motion segments classified according to object-based features. The first two types of summaries are based on cinematic features only for speedy processing, while the summaries of the last type contain higher-level semantics. The proposed framework is efficient, effective, and robust for soccer video processing. It is efficient in the sense that there is no need to compute object-based features when cinematic features are sufficient for the detection of certain events, e.g., goals in soccer. It is effective in the sense that the framework can also employ object-based features when needed to increase accuracy (at the expense of more computation). The efficiency, effectiveness, and the robustness of the proposed framework are demonstrated over a large data set, consisting of more than 13 hours of soccer video, captured at different countries and conditions.
引用
收藏
页码:796 / 807
页数:12
相关论文
共 30 条
[1]
[Anonymous], 1995, P IEEE INT C MULT CO
[2]
ASSFALG J, 2002, P IEEE INT C MULT EX
[3]
Event based indexing of broadcasted sports video by intermodal collaboration [J].
Babaguchi, N ;
Kawai, Y ;
Kitahashi, T .
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) :68-75
[4]
The Holy Grail of content-based media analysis [J].
Chang, SF .
IEEE MULTIMEDIA, 2002, 9 (02) :6-+
[5]
EKIN A, 2002, P SPIE JAN
[6]
Ferman A. M., 2001, Journal of Electronic Imaging, V10, P917, DOI 10.1117/1.1406946
[7]
Efficient filtering and clustering methods for temporal video segmentation and visual summarization [J].
Ferman, AM ;
Tekalp, AM .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1998, 9 (04) :336-351
[8]
Temporal segmentation of video objects for hierarchical object-based motion description [J].
Fu, Y ;
Ekin, A ;
Tekalp, AM ;
Mehrotra, R .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (02) :135-145
[9]
Tracking pitches for broadcast television [J].
Guéziec, A .
COMPUTER, 2002, 35 (03) :38-+