Reliable dissolve detection

被引:66
作者
Lienhart, R [1 ]
机构
[1] Intel Corp, Microprocessor Res Labs, Santa Clara, CA 95052 USA
来源
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001 | 2001年 / 4315卷
关键词
D O I
10.1117/12.410931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
Automatic shot boundary detection has been an active research area for nearly a decade and has led to high performance detection algorithms for hard cuts, fades and wipes. Reliable dissolve detection, however, is still an unsolved problem. In this paper we present the first robust and reliable dissolve detection system. A detection rate of 69% was achieved while reducing the false alarm rate to an acceptable level of 68% on a test video set for which so far the best reported detection and false alarm rare had been 57% and 185%, respectively! In addition, the temporal extent of the dissolves are estimated by a multi-resolution detection approach. The three core ideas of our novel approach are firstly the creation of a dissolve synthesizer capable of creating in principle an infinite number of dissolve examples of any duration from a video database of raw video footage, secondly two new features for capturing the characteristics of dissolves, and thirdly, the exploitation of machine learning ideas for reliable object detection such as the bootstrap-method to improve the set of non-dissolve examples and the search at multiple resolutions as well as the usage of machine learning algorithms such as neural networks, support-vector machines and linear vector quantizer.
引用
收藏
页码:219 / 230
页数:4
相关论文
共 18 条
[1]
Bordwell David, 2013, Film Art: An Introduction
[2]
Comparison of video shot boundary detection techniques [J].
Boreczky, JS ;
Rowe, LA .
STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 :170-179
[3]
DAILIANAS A, 1995, P SOC PHOTO-OPT INS, V2615, P2
[4]
Performance characterization and comparison of video indexing [J].
Gargi, U ;
Kasturi, R ;
Antani, S .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :559-565
[5]
GARGI U, 2000, IEEE T CIRCUITS SYST, V10
[6]
LIENHART R, 1999, P SPIE, V3656
[7]
Masters T., 1994, SIGNAL IMAGE PROCESS
[8]
Mitchell T., 1997, Machine Learning, V7, P2
[9]
Neural network-based face detection [J].
Rowley, HA ;
Baluja, S ;
Kanade, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) :23-38
[10]
Probabilistic modeling of local appearance and spatial relationships for object recognition [J].
Schneiderman, H ;
Kanade, T .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :45-51