Shot-boundary detection: Unraveled and resolved?

被引:274
作者
Hanjalic, A [1 ]
机构
[1] Delft Univ Technol, Fac Informat Technol & Syst, Dept Mediamat, NL-2628 CD Delft, Netherlands
关键词
shot-boundary detection; video analysis; video databases; video retrieval;
D O I
10.1109/76.988656
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Partitioning a video sequence into shots is the first step toward video-content analysis and content-based video browsing and retrieval. A video shot is defined as a series of interrelated consecutive frames taken contiguously by a single camera and representing a continuous action in time and space. As such, shots are considered to be the primitives for higher level content analysis, indexing, and classification. The objective of this paper is twofold. First, we analyze the shot-boundary detection problem in detail and identify major issues that need to be considered in order to solve this problem successfully. Then, we present a conceptual solution to the shot-boundary detection problem in which all issues identified in the previous step are considered. This solution is provided in the form of a statistical detector that is based on minimization of the average detection-error probability. We model the required statistical functions using a robust metric for visual content discontinuities (based on motion compensation) and take into account all (a priori) knowledge that we found relevant to shot-boundary detection. This knowledge includes the shot-length distribution, visual discontinuity patterns at shot boundaries, and characteristic temporal changes of visual features around a boundary. Major advantages of the proposed detector are its robust and sequence-independent performance, while there is also the possibility to detect different types of shot boundaries simultaneously. We demonstrate the performance of our detector regarding two most widely used types of shot boundaries: hard cuts and dissolves.
引用
收藏
页码:90 / 105
页数:16
相关论文
共 31 条
[1]   A survey of technologies for parsing and indexing digital video [J].
Ahanger, G ;
Little, TDC .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) :28-43
[2]  
AKUTSU A, 1992, P SPIE VISUAL COMMUN, P1522
[3]  
ALATTAR AM, 1993, P IEEE INT S CIRC SY, V1, P13
[4]  
ARMAN F, 1993, P SOC PHOTO-OPT INS, V1908, P2, DOI 10.1117/12.143638
[5]  
Ba Tu Truong, 2000, Proceedings ACM Multimedia 2000, P219, DOI 10.1145/354384.354481
[6]  
Bordwell David, 2013, Film Art: An Introduction
[7]   Comparison of video shot boundary detection techniques [J].
Boreczky, JS ;
Rowe, LA .
STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 :170-179
[8]   IMAGE ACTIVITY CHARACTERISTICS IN BROADCAST TELEVISION [J].
COLL, DC ;
CHOMA, GK .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1976, 24 (10) :1201-1206
[9]  
CURRAN TF, 1965, P IEEE, P1770
[10]  
*ETS, 1994, 30042U ETS EBUETSI J