Multimodal approach for summarizing and indexing news video

被引:8
作者
Kim, JG [1 ]
Chang, HS
Kim, YT
Kang, K
Kim, M
Kim, J [1 ]
Kim, HM
机构
[1] Elect & Telecommun Res Inst, Broadcasting Media Technol Dept, Taejon, South Korea
[2] Informat & Commun Univ, Sch Engn, Taejon, South Korea
[3] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Taejon, South Korea
关键词
D O I
10.4218/etrij.02.0102.0101
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A video summary abstracts the gist from an entire video and also enables efficient access to the desired content. In this paper, we propose a novel method for summarizing news video based on multimodal analysis of the content. The proposed method exploits the closed caption data to locate semantically meaningful highlights in a news video and speech signals in an audio stream to align the closed caption data with the video in a time-line. Then, the detected highlights are described using MPEG-7 Summarization Description Scheme, which allows efficient browsing of the content through such functionalities as multi-level abstracts and navigation guidance. Multimodal search and retrieval are also within the proposed framework. By indexing synchronized closed caption data, the video clips are searchable by inputting a text query. Intensive experiments with prototypical systems are presented to demonstrate the validity and reliability of the proposed method in real applications.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 17 条
[1]  
[Anonymous], **NON-TRADITIONAL**
[2]  
CHANG HS, 2000, JTC1SC29WG11 ISOIEC
[3]   An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis [J].
Hanjalic, A ;
Zhang, HJ .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (08) :1280-1289
[4]  
HANJALIC A, 1999, P IS T SPIE STORAG 7, V3656, P86
[5]   Automated generation of news content hierarchy by integrating audio, video, and text information [J].
Huang, Q ;
Liu, Z ;
Rosenberg, A ;
Gibbon, D ;
Shahraray, B .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :3025-3028
[6]   Summary description schemes for efficient video navigation and browsing [J].
Kim, JG ;
Chang, HS ;
Kim, M ;
Kim, J ;
Kim, HM .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 :1397-1408
[7]   Broadcast news navigation using story segmentation [J].
Merlino, A ;
Morey, D ;
Maybury, M .
ACM MULTIMEDIA 97, PROCEEDINGS, 1997, :381-391
[8]  
*MPEG MDS GROUP, 2001, JTC1SC29WG11 ISOIEC
[9]  
*MPEG MDS GROUP, 1999, JTC1SC29WG11 ISOIEC
[10]   Abstracting digital movies automatically [J].
Pfeiffer, S ;
Lienhart, R ;
Fischer, S ;
Effelsberg, W .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (04) :345-353