Video shot-boundary detection using singular-value decomposition and statistical tests

被引:12
作者
Cernekova, Zuzana [1 ]
Kotropoulos, Constantine [1 ]
Pitas, Ioannis [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Artificial Intelligence & Informat Anal Lab, Thessaloniki 54124, Greece
关键词
D O I
10.1117/1.2812528
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 [电气工程]; 0809 [电子科学与技术];
摘要
We deal with video shot-cut detection in digital videos using the singular-value decomposition (SVD). SVD is performed on a matrix whose columns are the 3D frame color histograms. We have used SVD for its capabilities to derive a refined low-dimensional feature space from the high-dimensional raw feature space, where similar video patterns are placed together and can be easily clustered. After SVD is performed, a two-phase process is employed to detect the shots. In the first phase, a dynamic clustering method is used to create the frame clusters. In the second phase, every two consecutive clusters, obtained by the clustering procedure, are tested for a possible merging in order to reduce false shot-cut detections. In the merging phase, statistical hypothesis testing is used. The detection technique was applied to several TRECVID video test sets that exhibit different types of shots and contain significant object and camera motion inside the shots. We demonstrate that the method detects cuts and gradual transitions, such as dissolves and fades, with high accuracy. (c) 2007 SPIE and IS&T.
引用
收藏
页数:13
相关论文
共 44 条
[1]
A survey of technologies for parsing and indexing digital video [J].
Ahanger, G ;
Little, TDC .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) :28-43
[2]
Alattar AM, 1997, INT CONF ACOUST SPEE, P3025, DOI 10.1109/ICASSP.1997.595429
[3]
AMIR A, 2003, TREC VIDEO RETRIEVAL
[4]
[Anonymous], 1979, Multivariate analysis
[5]
Ba Tu Truong, 2000, Proceedings ACM Multimedia 2000, P219, DOI 10.1145/354384.354481
[6]
Banerjee A, 2005, J MACH LEARN RES, V6, P1345
[7]
Exploiting latent semantic information in statistical language modeling [J].
Bellegarda, JR .
PROCEEDINGS OF THE IEEE, 2000, 88 (08) :1279-1296
[8]
Bimbo A., 1999, VISUAL INFORM RETRIE
[9]
Butz T, 2001, 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, P422, DOI 10.1109/ICIP.2001.958141
[10]
CABEDO XU, 1998, P NONL MOD BAS IM AN, P121