Spatiotemporal segmentation based on region merging

被引:133
作者
Moscheni, F [1 ]
Bhattacharjee, S [1 ]
Kunt, M [1 ]
机构
[1] Swiss Fed Inst Technol, Signal Proc Lab, CH-1015 Lausanne, Switzerland
关键词
automatic spatiotemporal segmentation; object segmentation; region merging; modified Kolmogorov-Smirnov test; weighted directed graph;
D O I
10.1109/34.713358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a technique for spatiotemporal segmentation to identify the objects present in the scene represented in a video sequence. This technique processes two consecutive frames at a time. A region-merging approach is used to identify the objects in the scene. Starting from an oversegmentation of the current frame, the objects are formed by iteratively merging regions together. Regions are merged based on their mutual spatiotemporal similarity. The spatiotemporal similarity measure takes both temporal and spatial Information into account, the emphasis being on the former. We propose a Modified Kolmogorov-Smirnov test for estimating the temporal similarity. This;est efficiently uses temporal information in both the residual distribution and the motion parametric representation. The region-merging process is based on a weighted, directed graph. Two complementary graph-based clustering rules are proposed, namely, the strong rule and the weak rule. These rules take advantage of the natural structures present in the graph. Also, the rules take into account the possible errors and uncertainties reported in the graph. The weak rule is applied after the strong rule. Each rile is applied iteratively and the graph is updated after each iteration. Experimental results on different types of scenes demonstrate the ability of the proposed technique to automatically partition the scene into its constituent objects.
引用
收藏
页码:897 / 915
页数:19
相关论文
共 49 条
[1]   INHERENT AMBIGUITIES IN RECOVERING 3-D MOTION AND STRUCTURE FROM A NOISY FLOW FIELD [J].
ADIV, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (05) :477-489
[3]  
ALLMEN M, 1993, CVGIP-IMAG UNDERSTAN, V58, P338, DOI 10.1006/ciun.1993.1046
[4]  
Anandan P., 1993, MOTION ANAL IMAGE SE, P1, DOI [10.1007/978-1-4615-3236-1_1, DOI 10.1007/978-1-4615-3236-1_1]
[5]  
[Anonymous], IMAGE COMMUN
[6]  
AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
[7]  
AYER S, 1993, IEEE WORKSH IM MULT, P122
[8]  
AYER S, 1993, TIME VARYING IMAGE P
[9]  
AYER S, 1994, ECCV94, V2, P316
[10]   PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].
BARRON, JL ;
FLEET, DJ ;
BEAUCHEMIN, SS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77