Real-time video object segmentation for MPEG encoded video sequences

被引:22
作者
Porikli, F [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, England
来源
REAL-TIME IMAGING VIII | 2004年 / 5297卷
关键词
MPEG; compressed domain segmentation; volume growing;
D O I
10.1117/12.527188
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
We propose a real-time object segmentation method for MPEG encoded video. Computational superiority is the main advantage of compressed domain processing. We exploit the macro-block structure of the encoded video to decrease the spatial resolution of the processed data, which exponentially reduces the computational load. Further reduction is achieved by temporal grouping of the intra-coded and estimated frames into a single feature layer. In addition to computational advantage, compressed-domain video possesses important features attractive for object analysis. Texture characteristics are provided by the DCT coefficients. Motion information is readily available without incurring cost of estimating a motion field. To achieve segmentation, the DCT coefficients for I-frames and block motion vectors for P-frames are combined and a frequency-temporal data structure is constructed. Starting from the blocks where the ac-coefficient energy and local inter-block dc-coefficient variance is small, the homogeneous volumes are enlarged by evaluating the distance of candidate vectors to the volume characteristics. Affine motion models are fit to volumes. Finally, a hierarchical clustering stage iteratively merges the most similar parts to generate an object partition tree as an output.
引用
收藏
页码:195 / 203
页数:9
相关论文
共 9 条
[1]  
Babu R., 2002, IEEE INT C AC SPEECH
[2]  
CAVALLI F, 2002, INT S VID IM PROC MU
[3]  
DEQUEIROZ R, 2000, IEEE T IMAGE PROCESS
[4]  
JI S, 1998, IEEE T IMAGE PROCESS
[5]   Computation-constrained fast MPEG-2 encoding [J].
Kossentini, F ;
Lee, YW .
IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) :224-226
[6]  
PORIKLI F, 2004, J APPL SIGNAL PR JAN
[7]  
SUKMARG O, 2000, P IEEE REG 10 TECHN
[8]  
WANG H, 1996, ELECT IMAGING MULTIM
[9]  
WANG R, 2000, IEEE INT S CIRC SYST