A fuzzy video content representation for video summarization and content-based retrieval

被引：95

作者：

Doulamis, AD ^{[1
]}

Doulamis, ND ^{[1
]}

Kollias, SD ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, Zografos 15773, Greece

来源：

SIGNAL PROCESSING | 2000年 / 80卷 / 06期

关键词：

video summarization; content-based retrieval; fuzzy logic; genetic algorithms;

D O I：

10.1016/S0165-1684(00)00019-0

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a fuzzy representation of visual content is proposed, which is useful for the new emerging multimedia applications, such as content-based image indexing and retrieval, video browsing and summarization. In particular, a multidimensional fuzzy histogram is constructed for each video frame based on a collection of appropriate features, extracted using video sequence analysis techniques. This approach is then applied both for video summarization, in the context of a content-based sampling algorithm, and for content-based indexing and retrieval. In the first case, video summarization is accomplished by discarding shots or frames of similar visual content so that only a small but meaningful amount of information is retained (key-frames). In the second case, a content-based retrieval scheme is investigated, so that the most similar images to a query are extracted. Experimental results and comparison with other known methods are presented to indicate the good performance of the proposed scheme on real-life video recordings. (C) 2000 Elsevier Science B.V. All rights reserved.

引用

页码：1049 / 1067

页数：19

共 24 条

[1] Image sequence analysis for emerging interactive multimedia services - The European COST 211 framework [J].

Alatan, AA ;

Onural, L ;

Wollborn, M ;

Mech, R ;

Tuncel, E ;

Sikora, T .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (07) :802-813

[2]

[Anonymous], 1992, NEURAL NETWORKS FUZZ

[3]

AVRITHIS Y, 1998, P WORK VER LOW BIT R

[4] A stochastic framework for optimal key frame extraction from MPEG video databases [J].

Avrithis, YS ;

Doulamis, AD ;

Doulamis, ND ;

Kollias, SD .

COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) :3-24

[5] A fully automated content-based video search engine supporting spatiotemporal queries [J].

Chang, SF ;

Chen, W ;

Meng, HJ ;

Sundaram, H ;

Zhong, D .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :602-615

[6] Visual image retrieval by elastic matching of user sketches [J].

DelBimbo, A ;

Pala, P .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :121-132

[7] On-line retrainable neural networks: Improving the performance of neural networks in image analysis problems [J].

Doulamis, AD ;

Doulamis, ND ;

Kollias, SD .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (01) :137-155

[8] Low bit-rate coding of image sequences using adaptive regions of interest [J].

Doulamis, N ;

Doulamis, A ;

Kalogeras, D ;

Kollias, S .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (08) :928-934

[9]

FLICKNER M, 1995, IEEE COMPUT, V28, P23, DOI DOI 10.1109/2.410146

[10]

GARRIDO L, 1997, P WORKSH IM AN MULT, P13

← 1 2 3 →