Multiscale content extraction and representation for video indexing

被引：26

作者：

Ferman, AM

Tekalp, AM

机构：

来源：

MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II | 1997年 / 3229卷

关键词：

video content extraction; video indexing; temporal segmentation; clustering;

D O I：

10.1117/12.290352

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

This paper presents a general multiscale framework for extraction and representation of video content. The approach exploits the inherent multiscale nature of many TV and film productions to delineate an input stream effectively and to construct consistent scenes reliably. The method first utilizes basic signal processing techniques (i.e, temporal sampling, local windowing, mean and median filtering), and unsupervised clustering to determine shot boundaries in the video sequence. Similarity comparison using shot representative histograms and clustering is then carried out within each shot to automatically select representative key frames. Finally, a model that takes into account the filmic structure of the input stream is discussed and developed to efficiently merge individual shots into coherent, meaningful segments, i.e. scenes.

引用

页码：23 / 31

页数：9