MultiView: Multilevel video content representation and retrieval

被引:31
作者
Fan, JP [1 ]
Aref, WG
Elmagarmid, AK
Hacid, MS
Marzouk, MS
Zhu, XQ
机构
[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Univ Lyon 1, LISI, UFR Informat, F-69622 Villeurbanne, France
基金
美国国家科学基金会;
关键词
D O I
10.1117/1.1406944
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques. (C) 2001 SPIE and IS&T.
引用
收藏
页码:895 / 908
页数:14
相关论文
共 52 条
[1]   Image sequence analysis for emerging interactive multimedia services - The European COST 211 framework [J].
Alatan, AA ;
Onural, L ;
Wollborn, M ;
Mech, R ;
Tuncel, E ;
Sikora, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (07) :802-813
[2]  
[Anonymous], 1999, 7 ACM INT C MULT, DOI DOI 10.1145/319878.319896
[3]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[4]  
BERITEZ A, 2000, SIGNAL PROCESS-IMAGE, V16, P235
[5]  
CARSON C, 1999, P INT C VIS INF SYST
[6]   A fully automated content-based video search engine supporting spatiotemporal queries [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :602-615
[7]   High-efficiency Nd:YVO4 laser end-pumped with a diode laser bar [J].
Chen, YH ;
Xiong, ZJ ;
Lim, GC ;
Zheng, HY ;
Peng, XY .
PHOTONIC SYSTEMS AND APPLICATIONS IN DEFENSE AND MANUFACTURING, 1999, 3898 :148-155
[8]   Automatic video indexing via object motion analysis [J].
Courtney, JD .
PATTERN RECOGNITION, 1997, 30 (04) :607-625
[9]   The Bayesian image retrieval system, PicHunter:: Theory, implementation, and psychophysical experiments [J].
Cox, IJ ;
Miller, ML ;
Minka, TP ;
Papathomas, TV ;
Yianilos, PN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) :20-37
[10]   NeTra-V: Toward an object-based video representation [J].
Deng, YN ;
Manjunath, BS .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) :616-627