Multiscale content extraction and representation for video indexing

被引:26
作者
Ferman, AM
Tekalp, AM
机构
来源
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II | 1997年 / 3229卷
关键词
video content extraction; video indexing; temporal segmentation; clustering;
D O I
10.1117/12.290352
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
This paper presents a general multiscale framework for extraction and representation of video content. The approach exploits the inherent multiscale nature of many TV and film productions to delineate an input stream effectively and to construct consistent scenes reliably. The method first utilizes basic signal processing techniques (i.e, temporal sampling, local windowing, mean and median filtering), and unsupervised clustering to determine shot boundaries in the video sequence. Similarity comparison using shot representative histograms and clustering is then carried out within each shot to automatically select representative key frames. Finally, a model that takes into account the filmic structure of the input stream is discussed and developed to efficiently merge individual shots into coherent, meaningful segments, i.e. scenes.
引用
收藏
页码:23 / 31
页数:9
相关论文
empty
未找到相关数据