Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval

被引:94
作者
Mezaris, V [1 ]
Kompatsiaris, I
Boulgouris, NV
Strintzis, MG
机构
[1] Aristotle Univ Thessaloniki, Dept Elect & Comp Engn, Informat Proc Lab, Thessaloniki 54124, Greece
[2] CERTH, ITI, Thessaloniki 57001, Greece
关键词
compressed-domain segmentation; object-based video indexing; ontologies; real-time segmentation; relevance feedback; spatiotemporal video segmentation; support vector machines;
D O I
10.1109/TCSVT.2004.826768
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a novel algorithm is presented for the real-time, compressed-domain, unsupervised segmentation of image sequences and is applied to video indexing and retrieval. The segmentation algorithm uses motion and color information directly extracted from the MPEG-2 compressed stream. An iterative rejection scheme based on the bilinear motion model is used to effect foreground/background segmentation. Following that, meaningful foreground spatiotemporal objects are formed by initially examining the temporal consistency of the output of iterative rejection, clustering the resulting foreground mac-roblocks to connected regions and finally performing region tracking. Background segmentation to spatiotemporal objects is additionally performed. MPEG-7 compliant low-level descriptors describing the color, shape, position, and motion of the resulting spatiotemporal objects are extracted and are automatically mapped to appropriate intermediate-level descriptors forming a simple vocabulary termed object ontology. This, combined with a relevance feedback mechanism, allows the qualitative definition of the high-level concepts the user queries for (semantic objects, each represented by a keyword) and the retrieval of relevant video segments. Desired spatial and temporal relationships between the objects in multiple-keyword queries can also be expressed, using the shot ontology. Experimental results of the application of the segmentation algorithm to known sequences demonstrate the efficiency of the proposed segmentation approach. Sample queries reveal the potential of employing this segmentation algorithm as part of an object-based video indexing and retrieval scheme.
引用
收藏
页码:606 / 621
页数:16
相关论文
共 56 条
[1]   Semantic modeling and knowledge representation in multimedia databases [J].
Al-Khatib, W ;
Day, YF ;
Ghafoor, A ;
Berra, PB .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (01) :64-80
[2]  
ALATAN AA, 1998, IEEE T CIRCUITS SYST, V8, P19
[3]  
Babu RV, 2002, INT CONF ACOUST SPEE, P3788
[4]  
Berlin B, 1969, Basic Color Terms: Their Universality and Evolution
[5]   Segmentation and content-based watermarking for color image and image region indexing and retrieval [J].
Boulgouris, NV ;
Kompatsiaris, I ;
Mezaris, V ;
Simitopoulos, D ;
Strintzis, MG .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (04) :418-431
[6]  
BOULGOURIS NV, 2002, P 2002 TYRRH INT WOR, P295
[7]   Accommodating hybrid retrieval in a comprehensive video database management system [J].
Chan, SSM ;
Li, Q ;
Wu, Y ;
Zhuang, YT .
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (02) :146-159
[8]   What are ontologies, and why do we need them? [J].
Chandrasekaran, B ;
Josephson, JR ;
Benjamins, VR .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (01) :20-26
[9]  
CHANG CC, 2001, LIBSVM LIB SUPP VECT
[10]   Overview of the MPEG-7 standard [J].
Chang, SF ;
Sikora, T ;
Puri, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (06) :688-695