A robust framework for joint background/foreground segmentation of complex video scenes filmed with freely moving camera

被引:7
作者
Slim Amri
Walid Barhoumi
Ezzeddine Zagrouba
机构
[1] Equipe de Recherche Systèmes Intelligents en Imagerie et Vision Artificielle (SIIVA) Institut Supérieur d’Informatique,
来源
Multimedia Tools and Applications | 2010年 / 46卷
关键词
Video segmentation; Motion compensation; Moving objects; Background; Shadow identification;
D O I
暂无
中图分类号
学科分类号
摘要
This paper explores a robust region-based general framework for discriminating between background and foreground objects within a complex video sequence. The proposed framework works under difficult conditions such as dynamic background and nominally moving camera. The originality of this work lies essentially in our use of the semantic information provided by the regions while simultaneously identifying novel objects (foreground) and non-novel ones (background). The information of background regions is exploited to make moving objects detection more efficient, and vice-versa. In fact, an initial panoramic background is modeled using region-based mosaicing in order to be sufficiently robust to noise from lighting effects and shadowing by foreground objects. After the elimination of the camera movement using motion compensation, the resulting panoramic image should essentially contain the background and the ghost-like traces of the moving objects. Then, while comparing the panoramic image of the background with the individual frames, a simple median-based background subtraction permits a rough identification of foreground objects. Joint background-foreground validation, based on region segmentation, is then used for a further examination of individual foreground pixels intended to eliminate false positives and to localize shadow effects. Thus, we first obtain a foreground mask from a slow-adapting algorithm, and then validate foreground pixels (moving visual objects + shadows) by a simple moving object model built by using both background and foreground regions. The tests realized on various well-known challenging real videos (across a variety of domains) show clearly the robustness of the suggested solution. This solution, which is relatively computationally inexpensive, can be used under difficult conditions such as dynamic background, nominally moving camera and shadows. In addition to the visual evaluation, spatial-based evaluation statistics, given hand-labeled ground truth, has been used as a performance measure of moving visual objects detection.
引用
收藏
页码:175 / 205
页数:30
相关论文
共 51 条
[41]  
Vincent L(undefined)undefined undefined undefined undefined-undefined
[42]  
Soille P(undefined)undefined undefined undefined undefined-undefined
[43]  
Yan WQ(undefined)undefined undefined undefined undefined-undefined
[44]  
Wang J(undefined)undefined undefined undefined undefined-undefined
[45]  
Kankanhalli MS(undefined)undefined undefined undefined undefined-undefined
[46]  
Yilmaz A(undefined)undefined undefined undefined undefined-undefined
[47]  
Shah M(undefined)undefined undefined undefined undefined-undefined
[48]  
Zagrouba E(undefined)undefined undefined undefined undefined-undefined
[49]  
Barhoumi W(undefined)undefined undefined undefined undefined-undefined
[50]  
Amri S(undefined)undefined undefined undefined undefined-undefined