Video object segmentation: A compressed domain approach

被引:80
作者
Babu, RV [1 ]
Ramakrishnan, KR
Srinivasan, SH
机构
[1] NTNU, Ctr Quantifiable Qual Serv Commun Syst, Trondheim, Norway
[2] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
[3] Satyam Comp Serv Ltd, Bangalore 560025, Karnataka, India
关键词
compressed domain; expectation maximization (EM) algorithm; motion segmentation; MPEG-4; object segmentation; tracking; video object planes;
D O I
10.1109/TCSVT.2004.825536
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of extracting video objects from MPEG compressed video. The,only cues used for object segmentation are the motion vectors which are sparse in MPEG. A method for automatically estimating the number of objects and extracting independently moving video objects using Motion vectors is presented here. First, the motion vectors are accumulated over a few frames to enhance the motion information, which are further spatially interpolated to get dense motion vectors. The final segmentation, using the dense motion vectors, is obtained by applying the expectation maximization (EM) algorithm. A block-based affine clustering method is proposed for determining the number of appropriate motion models to be used for the EM step and the segmented objects are temporally tracked to obtain the video objects. Finally, a strategy for edge refinement is proposed to, extract the,precise object boundaries. Illustrative examples are provided to demonstrate the efficacy of the approach. A prominent application of the proposed method is that of object-based coding, which is part of the MPEG-4 standard.
引用
收藏
页码:462 / 474
页数:13
相关论文
共 26 条
[1]  
AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
[2]   The robust estimation of multiple motions: Parametric and piecewise-smooth flow fields [J].
Black, MJ ;
Anandan, P .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1996, 63 (01) :75-104
[3]   Motion segmentation by multistage affine classification [J].
Borshukov, GD ;
Bozdagi, G ;
Altunbasak, Y ;
Tekalp, AM .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (11) :1591-1594
[4]  
Brady N, 1996, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, P925, DOI 10.1109/ICIP.1996.559651
[5]   COOPERATIVE ROBUST ESTIMATION USING LAYERS OF SUPPORT [J].
DARRELL, T ;
PENTLAND, AP .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (05) :474-487
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]  
Dufaux F., 1995, Proceedings. International Conference on Image Processing (Cat. No.95CB35819), P306, DOI 10.1109/ICIP.1995.529707
[8]  
ELIAS DP, 1998, THESIS U CAMBRIDGE D
[9]   A region-level graph labeling approach to motion-based segmentation [J].
Gelgon, M ;
Bouthemy, P .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :514-519
[10]   Semiautomatic segmentation and tracking of semantic video objects [J].
Gu, C ;
Lee, MC .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :572-584