A robust shot transition detection method based on support vector machine in compressed domain

被引:32
作者
Cao, Jianrong [1 ]
Cai, Anni
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] ShanDong Jianzhu Univ, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
compressed domain; shot transition detection; SVM; video; VIDEO; SEGMENTATION; MODEL;
D O I
10.1016/j.patrec.2007.03.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a new algorithm for shot transition detection. A multi-class support vector machine (SVM) classifier is constructed to differentiate frames of a video into three categories: abrupt change, gradual change and non-change. This approach enables us to integrate many kinds of features into a uniform structure and to eliminate arbitrary selection of thresholds. To enhance the robustness of the algorithm, we form the feature vector from all frames within a temporal windows, each frame represented by six features in compressed domain. Experimental results on TREC-2001 video data set have shown that the result of our algorithm is 8% higher than the best result of 2001 TREC evaluation in F1 comparison when cut and gradual changes are both considered. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1534 / 1540
页数:7
相关论文
共 20 条
[1]   A unified model for techniques on video-shot transition detection [J].
Bescós, J ;
Cisneros, G ;
Martínez, JM ;
Menéndez, JM ;
Cabrera, J .
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (02) :293-307
[2]   Foveated shot detection for video segmentation [J].
Boccignone, G ;
Chianese, A ;
Moscato, V ;
Picariello, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (03) :365-377
[3]  
Burges C.J.C., 1998, TUTORIAL SUPPORT VEC
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]  
Feng J, 1996, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, P821, DOI 10.1109/ICIP.1996.561031
[6]  
Friedman J.H., 1996, Another approach to polychotomous classification
[7]   Unsupervised video-shot segmentation and model-free, anchorperson detection for news video story parsing [J].
Gao, XB ;
Tang, X .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (09) :765-776
[8]  
Günsel B, 1998, J ELECTRON IMAGING, V7, P592, DOI 10.1117/1.482613
[9]  
Knerr S., 1990, Neurocomputing, Algorithms, Architectures and Applications. Proceedings of the NATO Advanced Research Workshop, P41
[10]   Statistical sequential analysis for real-time video scene change detection on compressed multimedia bitstream [J].
Lelescu, D ;
Schonfeld, D .
IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (01) :106-117