Spatiotemporal segmentation for compact video representation

被引:22
作者
Fan, JP
Yu, J
Fujita, G
Onoye, T
Wu, L
Shirakawa, I
机构
[1] Osaka Univ, Dept Informat Syst Engn, Suita, Osaka 565, Japan
[2] Fudan Univ, Dept Comp Sci, Shanghai 200433, Peoples R China
[3] So Methodist Univ, Dept Elect Engn, Dallas, TX 75275 USA
基金
日本学术振兴会;
关键词
similarity measure; entropic thresholding; spatiotemporal video segmentation; temporal tracking;
D O I
10.1016/S0923-5965(00)00036-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a novel hierarchical object-oriented video segmentation and representation algorithm is proposed. The local variance contrast and the frame difference contrast are jointly exploited for structural spatiotemporal video segmentation because these two visual features can indicate the spatial homogeneity of the grey levels and the temporal coherence of the motion fields efficiently, where the two-dimensional (2D) spatiotemporal entropic technique is further selected for generating the 2D thresholding vectors adaptively according to the variations of the video components. After the region growing and edge simplification procedures, the accurate boundaries among the different video components are further exploited by an intra-block edge extraction procedure. Moreover, the relationships of the video components among frames are exploited by a temporal tracking procedure. This proposed object-oriented spatiotemporal video segmentation algorithm may be useful for MPEG-4 system generating the video object plane (VOP) automatically. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:553 / 566
页数:14
相关论文
共 32 条
[11]   Motion estimation based on global and local uncompensability analysis [J].
Fan, JP ;
Gan, FX .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (11) :1584-1587
[12]   Spatiotemporal segmentation based on two-dimensional spatiotemporal entropic thresholding [J].
Fan, JP ;
Zhang, LM ;
Gan, FX .
OPTICAL ENGINEERING, 1997, 36 (10) :2845-2851
[13]   Image sequence segmentation based on 2D temporal entropic thresholding [J].
Fan, JP ;
Wang, R ;
Zhang, LM ;
Xing, DJ ;
Gan, FX .
PATTERN RECOGNITION LETTERS, 1996, 17 (10) :1101-1107
[14]   STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES [J].
GEMAN, S ;
GEMAN, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) :721-741
[15]  
Hotter M., 1990, Signal Processing: Image Communication, V2, P409, DOI 10.1016/0923-5965(90)90027-F
[16]   COMPARING IMAGES USING THE HAUSDORFF DISTANCE [J].
HUTTENLOCHER, DP ;
KLANDERMAN, GA ;
RUCKLIDGE, WJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (09) :850-863
[17]   Two-stage motion compensation using adaptive global MC and local affine MC [J].
Jozawa, H ;
Kamikura, K ;
Sagata, A ;
Kotera, H ;
Watanabe, H .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (01) :75-85
[18]   TEXTONS, THE ELEMENTS OF TEXTURE-PERCEPTION, AND THEIR INTERACTIONS [J].
JULESZ, B .
NATURE, 1981, 290 (5802) :91-97
[19]   MEASURING SPEED OF MOVING OBJECTS FROM TELEVISION SIGNALS [J].
LIMB, JO ;
MURPHY, JA .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1975, CO23 (04) :474-478
[20]   Automatic segmentation of moving objects for video object plane generation [J].
Meier, T ;
Ngan, KN .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :525-538