Event based indexing of broadcasted sports video by intermodal collaboration

被引:115
作者
Babaguchi, N [1 ]
Kawai, Y
Kitahashi, T
机构
[1] Osaka Univ, Inst Sci & Ind Res, Osaka 5670047, Japan
[2] NHK Japan Broadcasting Corp, Tokyo, Japan
基金
日本学术振兴会;
关键词
closed caption (CC) text; contents-based video indexing; event detection; multimodal information stream;
D O I
10.1109/6046.985555
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose event-based video indexing, which is a kind of indexing by its semantical contents. Because video data is composed of multimodal information streams such as visual, auditory, and textual [closed caption (CC)] streams, we introduce a strategy of intermodal collaboration, i.e., collaborative processing taking account of the semantical dependency between these streams. Its aim is to improve the reliability and efficiency in contents analysis of video. Focusing here on temporal correspondence between visual and CC streams, the proposed method attempts to seek for time spans in which events are likely to take place through extraction of keywords from the CC stream and then to index shots in the visual stream. The experimental results for broadcasted sports video of American football games indicate that intermodal collaboration is effective for video indexing by the events such as touchdown (TD) and field goal (FG).
引用
收藏
页码:68 / 75
页数:8
相关论文
共 24 条
[11]  
LAZARESCU M, 1998, P 14 ICPR AUG, V2, P1238
[12]  
LAZARESCU M, 1999, P IEEE ICMCS 99, V1, P802
[13]   Video abstracting [J].
Lienhart, R ;
Pfeiffer, S ;
Effelsberg, W .
COMMUNICATIONS OF THE ACM, 1997, 40 (12) :54-62
[14]   Broadcast news navigation using story segmentation [J].
Merlino, A ;
Morey, D ;
Maybury, M .
ACM MULTIMEDIA 97, PROCEEDINGS, 1997, :381-391
[15]   Semantic analysis for video contents extraction - Spotting by Association in news video [J].
Nakamura, Y ;
Kanade, T .
ACM MULTIMEDIA 97, PROCEEDINGS, 1997, :393-401
[16]  
Nitta N, 2000, INT C PATT RECOG, P718
[17]  
Rui Y., 2000, Proceedings ACM Multimedia 2000, P105, DOI 10.1145/354384.354443
[18]   Name-it: Naming and detecting faces in news videos [J].
Satoh, S ;
Nakamura, Y ;
Kanade, T .
IEEE MULTIMEDIA, 1999, 6 (01) :22-35
[19]  
Shahraray B., 1995, P ACM MULT 95 SAN FR, P401
[20]   Video skimming and characterization through the combination of image and language understanding techniques [J].
Smith, MA ;
Kanade, T .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :775-781