Region-based representations of image and video:: Segmentation tools for multimedia services

被引:144
作者
Salembier, P [1 ]
Marqués, F [1 ]
机构
[1] Univ Politecn Catalunya, Dept Signal Theory & Commun, Barcelona, Spain
关键词
compression; indexing; motion estimation; MPEG-4; MPEG-7; object tracking; partition tree; regions; shot detection; spatial and temporal segmentation; video objects;
D O I
10.1109/76.809153
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper discusses region-based representations of image and video that are useful for multimedia services such as those supported by the MPEG-4 and MPEG-7 standards. Classical tools related to the generation of the region-based representations are discussed. After a description of the main processing steps and the corresponding choices in terms of feature spaces, decision spaces, and decision algorithms, the state of the art in segmentation is reviewed. Mainly tools useful in the context of the MPEG-4 and MPEG-7 standard are discussed. The review is structured around the strategies used by the algorithms (transition based or homogeneity based) and the decision spaces (spatial, spatio-temporal, and temporal). The second part of this paper proposes a partition tree representation of images and introduces a processing strategy that involves a similarity estimation step followed by a partition creation step. This strategy tries to find a compromise between what can be done in a systematic and universal way and what has to be application dependent, It is shown in particular how a single partition tree created with an extremely simple similarity feature can support a large number of segmentation applications: spatial segmentation, motion estimation, region-based coding, semantic object extraction, and region-based retrieval.
引用
收藏
页码:1147 / 1169
页数:23
相关论文
共 69 条
[41]   ROBUST MULTIRESOLUTION ESTIMATION OF PARAMETRIC MOTION MODELS [J].
ODOBEZ, JM ;
BOUTHEMY, P .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1995, 6 (04) :348-365
[42]   Direct incremental model-based image motion segmentation for video analysis [J].
Odobez, JM ;
Bouthemy, P .
SIGNAL PROCESSING, 1998, 66 (02) :143-155
[43]  
ORTEGA A, 1992, P IEEE INT S CIRC SY, V1, P223
[44]  
PARDAS M, 1998, P INT C IM PROC ICIP
[45]  
PARDAS M, 1994, P EUSIPCO 94 7 EUR S, P18
[46]  
PAVLIDIS T, 1977, STRUCTRAL PATTERN RE, P68
[47]   Image compression using binary space partitioning trees [J].
Radha, H ;
Vetterli, M ;
Leonardi, R .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1996, 5 (12) :1610-1624
[48]   Best wavelet packet bases in a rate-distortion sense [J].
Ramchandran, Kannan ;
Vetterli, Martin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1993, 2 (02) :160-175
[49]   JOINT OPTIMIZATION OF REPRESENTATION MODEL AND FRAME SEGMENTATION FOR GENERIC VIDEO COMPRESSION [J].
REUSENS, E .
SIGNAL PROCESSING, 1995, 46 (01) :105-117
[50]   Antiextensive connected operators for image and sequence processing [J].
Salembier, P ;
Oliveras, A ;
Garrido, L .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (04) :555-570