Semantic video object extraction using four-band watershed and partition lattice operators

被引:19
作者
Gatica-Perez, D [1 ]
Gu, C
Sun, MT
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Univ Washington, Human Interface Technol Lab, Seattle, WA 98195 USA
[3] Microsoft Corp, Redmond, WA 98052 USA
关键词
mathematical morphology; multivalued watershed; partition lattice operators; semantic video object;
D O I
10.1109/76.920190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We conceive the problem of multiple semantic video object (SVO) extraction as an issue of designing extensive operators on a complete lattice of partitions. As a result, we propose a framework based on spatial partition generation and application of optimal operators on the generated partitions. Based on a statistical analysis of the watershed algorithm, we develop a multivalued morphological spatial segmentation method that incorporates an edge-driven marker extraction algorithm and a growing method which integrates both color and edge information. Having embedded the problem in the partition lattice framework, we propose a spatio-temporal regional maximum likelihood operator for extraction purposes. Some theoretical properties of the operator are established. Experimental results on several MPEG-4 test video sequences show that our scheme improves the precision of the exacted SVO boundaries compared to traditional watershed algorithms and provides accurate tracking of multiple SVOs in both static and moving camera scenarios. Furthermore, this scheme can be extended to deal with more general interactive video authoring systems.
引用
收藏
页码:603 / 618
页数:16
相关论文
共 36 条
[1]  
Beucher S., 2018, Mathematical morphology in image processing, P433, DOI DOI 10.1201/9781482277234-12
[2]   EigenTracking: Robust matching and tracking of articulated objects using a view-based representation [J].
Black, MJ ;
Jepson, AD .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 26 (01) :63-84
[3]  
BLAKE A, 1988, ACTIVE CONTOURS
[4]  
Bremond F., 1996, P WORKSH CONC DESCR
[6]   A fully automated content-based video search engine supporting spatiotemporal queries [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :602-615
[7]   The role of analysis in content-based video coding and indexing [J].
Correia, P ;
Pereira, F .
SIGNAL PROCESSING, 1998, 66 (02) :125-142
[8]   Extensive operators in partition lattices for image sequence analysis [J].
Garrido, L ;
Salembier, P ;
Garcia, D .
SIGNAL PROCESSING, 1998, 66 (02) :157-180
[9]  
Heijmans HJAM., 1994, MORPHOLOGICAL IMAGE
[10]  
Li SZ., 1995, Markov random field modeling in computer vision