Region-based representations of image and video:: Segmentation tools for multimedia services

被引:144
作者
Salembier, P [1 ]
Marqués, F [1 ]
机构
[1] Univ Politecn Catalunya, Dept Signal Theory & Commun, Barcelona, Spain
关键词
compression; indexing; motion estimation; MPEG-4; MPEG-7; object tracking; partition tree; regions; shot detection; spatial and temporal segmentation; video objects;
D O I
10.1109/76.809153
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper discusses region-based representations of image and video that are useful for multimedia services such as those supported by the MPEG-4 and MPEG-7 standards. Classical tools related to the generation of the region-based representations are discussed. After a description of the main processing steps and the corresponding choices in terms of feature spaces, decision spaces, and decision algorithms, the state of the art in segmentation is reviewed. Mainly tools useful in the context of the MPEG-4 and MPEG-7 standard are discussed. The review is structured around the strategies used by the algorithms (transition based or homogeneity based) and the decision spaces (spatial, spatio-temporal, and temporal). The second part of this paper proposes a partition tree representation of images and introduces a processing strategy that involves a similarity estimation step followed by a partition creation step. This strategy tries to find a compromise between what can be done in a systematic and universal way and what has to be application dependent, It is shown in particular how a single partition tree created with an extremely simple similarity feature can support a large number of segmentation applications: spatial segmentation, motion estimation, region-based coding, semantic object extraction, and region-based retrieval.
引用
收藏
页码:1147 / 1169
页数:23
相关论文
共 69 条
[1]   A survey of technologies for parsing and indexing digital video [J].
Ahanger, G ;
Little, TDC .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) :28-43
[2]   Image sequence analysis for emerging interactive multimedia services - The European COST 211 framework [J].
Alatan, AA ;
Onural, L ;
Wollborn, M ;
Mech, R ;
Tuncel, E ;
Sikora, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (07) :802-813
[3]  
[Anonymous], VISUAL DATABASE SYST
[4]  
[Anonymous], 1993, MARKOV RANDOM FIELDS
[5]  
Arman F., 1993, Proceedings ACM Multimedia 93, P267, DOI 10.1145/166266.166297
[6]  
BESAG JE, 1972, J ROY STAT SOC B, V34, P75
[7]  
BONNAUD L, 1997, P INT C IM PROC OCT, V2, P426
[9]   Video segmentation based on multiple features for interactive multimedia applications [J].
Castagno, R ;
Ebrahimi, T ;
Kunt, M .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :562-571
[10]  
CHALOM E, 1996, P INT C IM PROC ICIP, V2, P525