Efficient summarization of stereoscopic video sequences

被引:56
作者
Doulamis, ND [1 ]
Doulamis, AD [1 ]
Avrithis, YS [1 ]
Ntalianis, KS [1 ]
Kollias, SD [1 ]
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, GR-15773 Athens, Greece
关键词
content-based indexing and retrieval; stereoscopic image analysis; video summarization;
D O I
10.1109/76.844996
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An efficient technique for summarization of stereoscopic video sequences is presented in this paper, which extracts a small but meaningful set of video frames using a content-based sampling algorithm. The proposed video-content representation provides the capability of browsing digital stereoscopic video sequences and performing more efficient content-based queries and indexing. Each stereoscopic video sequence is first partitioned into shots by applying a shot-cut detection algorithm so that frames (or stereo pairs) of similar visual characteristics are gathered together. Each shot is then analyzed using stereo-imaging techniques, and the disparity field, occluded areas, and depth map are estimated. A multiresolution implementation of the Recursive Shortest Spanning Tree (RSST) algorithm is applied for color and depth segmentation, while fusion of color and depth segments is employed for reliable video object extraction. In particular, color segments are projected onto depth segments so that video objects on the same depth plane are retained, while at the same time accurate object boundaries are extracted. Feature vectors are then constructed using multidimensional fuzzy classification of segment features including size, location, color, and depth. Shot selection is accomplished by clustering similar shots based on the generalized Lloyd-Max algorithm, while for a given shot, key frames are extracted using an optimization method for locating frames of minimally correlated feature vectors. For efficient implementation of the latter method, a genetic algorithm is used. Experimental results are presented, which indicate the reliable performance of the proposed scheme on real-life stereoscopic video sequences.
引用
收藏
页码:501 / 517
页数:17
相关论文
共 48 条
[1]   Image sequence analysis for emerging interactive multimedia services - The European COST 211 framework [J].
Alatan, AA ;
Onural, L ;
Wollborn, M ;
Mech, R ;
Tuncel, E ;
Sikora, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (07) :802-813
[2]  
[Anonymous], 1992, NEURAL NETWORKS FUZZ
[3]  
[Anonymous], IMAGE TECHNOLOGY NOV
[4]  
ARMAN F, 1994, ACM MULTIMEDIA AUG, P77
[5]  
AVRITHIS Y, 1998, P IEEE INT C COMP VI
[6]  
AVRITHIS Y, 1999, COMPUT VIS IMAGE UND, V75
[7]   DISPARITY ANALYSIS OF IMAGES [J].
BARNARD, ST ;
THOMPSON, WB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1980, 2 (04) :333-340
[8]   Video segmentation based on multiple features for interactive multimedia applications [J].
Castagno, R ;
Ebrahimi, T ;
Kunt, M .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :562-571
[9]   Next-generation content representation, creation, and searching for new-media applications in education [J].
Chang, SF ;
Eleftheriadis, A ;
Mcclintock, R .
PROCEEDINGS OF THE IEEE, 1998, 86 (05) :884-904
[10]   MPEG and multimedia communications [J].
Chiariglione, L .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (01) :5-18