Object-based multimedia content description schemes and applications for MPEG-7

被引:13
作者
Benitez, AB
Paek, S
Chang, SF
Puri, A
Huang, Q
Smith, JR
Li, CS
Bergman, LD
Judice, CN
机构
[1] Columbia Univ, Dept Elect Engn, Image & Adv Tel Lab, New York, NY 10027 USA
[2] AT&T Labs Res, Red Bank, NJ 07701 USA
[3] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY 10532 USA
[4] Eastman Kodak, Rochester, NY 14653 USA
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
MPEG-7; multimedia description scheme; multimedia representation; object-based description; multimedia; image; video; home media; archive;
D O I
10.1016/S0923-5965(00)00030-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we describe description schemes (DSs) for image, video, multimedia, home media, and archive content proposed to the MPEG-7 standard. MPEG-7 aims to create a multimedia content description standard in order to facilitate various multimedia searching and filtering applications. During the design process, special care was taken to provide simple but powerful structures that represent generic multimedia data. We use the extensible markup language (XML) to illustrate and exemplify the proposed DSs because of its interoperability and flexibility advantages. The main components of the image, video, and multimedia description schemes are object, feature classification, object hierarchy, entity-relation graph, code downloading, multi-abstraction levels, and modality transcoding. The home media description instantiates the former DSs proposing the 6-W semantic features for objects, and 1-P physical and 6-W semantic object hierarchies. The archive description scheme aims to describe collections of multimedia documents, whereas the former DSs only aim at individual multimedia documents. In the archive description scheme, the content of an archive is represented using multiple hierarchies of clusters, which may be related by entity-relation graphs. The hierarchy is a specific case of entity-relation graph using a containment relation. We explicitly include the hierarchy structure in our DSs because it isa natural way of defining composite objects, a more efficient structure for retrieval, and the representation structure used in MPEG-4. We demonstrate the feasibility and the efficiency of our description schemes by presenting applications that already use the proposed structures or will greatly benefit from their use. These applications are the visual apprentice, the AMOS-search system, a multimedia broadcast news browser, a storytelling system, and an image meta-search engine, MetaSEEk. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:235 / 269
页数:35
相关论文
共 23 条
[1]  
*AHG MPEG 7 EV LOG, 1999, JTC1SC29WG11MPEG99N4
[2]   Using relevance feedback in content-based image metasearch [J].
Benitez, AB ;
Beigi, M ;
Chang, SF .
IEEE INTERNET COMPUTING, 1998, 2 (04) :59-69
[3]  
BENITEZ AB, 1999, JTC1SC29WG11MPEG99P4
[4]   A fully automated content-based video search engine supporting spatiotemporal queries [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :602-615
[5]  
HUANG Q, 1999, P INT C AC SPEECH SI
[6]  
HUANG Q, 1999, JTC1SC29WG11MPEG99P4
[7]  
HUANG Q, 1999, IN PRESS IEEE T CIRC
[8]  
HUANG Q, 1999, P SPIE STOR RETR STI, V3656, P50
[9]  
JAIMES A, 1999, S EL IM MULT PROC AP, V7
[10]  
LINDSAY A, 1998, JTC1SC29WG11MPEG98M4