Constructing table-of-content for videos

被引:169
作者
Rui, Y [1 ]
Huang, TS [1 ]
Mehrotra, S [1 ]
机构
[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
关键词
video accessing; scene-level ToC construction;
D O I
10.1007/s005300050138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A fundamental task in video analysis is to extract structures from the video to facilitate user's access (browsing and retrieval). Motivated by the important role that the table of content (ToC) plays in a book, in this paper, we introduce the concept of ToC in the video domain. Some existing approaches implicitly use the ToC, but are mainly limited to low-level entities (e.g., shots and key frames). The drawbacks are that low-level structures (1) contain too many entries to be efficiently presented to the user; and (2) do not capture the underlying semantic structure of the video based on which the user may wish to browse/retrieve. To address these limitations, in this paper, we present an effective semantic-level ToC construction technique based on intelligent unsupervised clustering. It has the characteristics of better modeling the time locality and scene structure. Experiments based on real-world movie videos validate the effectiveness of the proposed approach. Examples are given to demonstrate the usage of the scene-based ToC in facilitating user's access to the video.
引用
收藏
页码:359 / 368
页数:10
相关论文
共 27 条
[1]  
AIGRAIN P, 1995, IJCAI WORKSH INT MUL, P5
[2]  
AOKI H, 1995, P ACM C MULT
[3]  
Arman F., 1993, P SPIE STOR RETR IM
[4]  
BOLLE RM, 1996, VIDEO QUERY KEYWORDS
[5]  
BORECZKY J, 1996, P SPIE STOR RETR IM
[6]  
Ford R. M., 1997, P IEEE C MULT COMP S
[7]  
GONG Y, 1995, P IEEE C MULT COMP S
[8]  
GRESLE PO, 1997, 2 INT C VIS INF SYST
[9]  
Hampapur A., 1994, P ACM C MULT
[10]  
KASTURI R, 1991, COMPUTER VISION