A fuzzy video content representation for video summarization and content-based retrieval

被引:95
作者
Doulamis, AD [1 ]
Doulamis, ND [1 ]
Kollias, SD [1 ]
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, Zografos 15773, Greece
关键词
video summarization; content-based retrieval; fuzzy logic; genetic algorithms;
D O I
10.1016/S0165-1684(00)00019-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a fuzzy representation of visual content is proposed, which is useful for the new emerging multimedia applications, such as content-based image indexing and retrieval, video browsing and summarization. In particular, a multidimensional fuzzy histogram is constructed for each video frame based on a collection of appropriate features, extracted using video sequence analysis techniques. This approach is then applied both for video summarization, in the context of a content-based sampling algorithm, and for content-based indexing and retrieval. In the first case, video summarization is accomplished by discarding shots or frames of similar visual content so that only a small but meaningful amount of information is retained (key-frames). In the second case, a content-based retrieval scheme is investigated, so that the most similar images to a query are extracted. Experimental results and comparison with other known methods are presented to indicate the good performance of the proposed scheme on real-life video recordings. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1049 / 1067
页数:19
相关论文
共 24 条
[1]   Image sequence analysis for emerging interactive multimedia services - The European COST 211 framework [J].
Alatan, AA ;
Onural, L ;
Wollborn, M ;
Mech, R ;
Tuncel, E ;
Sikora, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (07) :802-813
[2]  
[Anonymous], 1992, NEURAL NETWORKS FUZZ
[3]  
AVRITHIS Y, 1998, P WORK VER LOW BIT R
[4]   A stochastic framework for optimal key frame extraction from MPEG video databases [J].
Avrithis, YS ;
Doulamis, AD ;
Doulamis, ND ;
Kollias, SD .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) :3-24
[5]   A fully automated content-based video search engine supporting spatiotemporal queries [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :602-615
[6]   Visual image retrieval by elastic matching of user sketches [J].
DelBimbo, A ;
Pala, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :121-132
[7]   On-line retrainable neural networks: Improving the performance of neural networks in image analysis problems [J].
Doulamis, AD ;
Doulamis, ND ;
Kollias, SD .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (01) :137-155
[8]   Low bit-rate coding of image sequences using adaptive regions of interest [J].
Doulamis, N ;
Doulamis, A ;
Kalogeras, D ;
Kollias, S .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (08) :928-934
[9]  
FLICKNER M, 1995, IEEE COMPUT, V28, P23, DOI DOI 10.1109/2.410146
[10]  
GARRIDO L, 1997, P WORKSH IM AN MULT, P13