A generic video parsing system with a scene description language (SDL)

被引:8
作者
Gong, YH [1 ]
Chuan, CH [1 ]
Zhu, YW [1 ]
Sakauchi, M [1 ]
机构
[1] UNIV TOKYO,INST IND SCI,MINATO KU,TOKYO 106,JAPAN
关键词
D O I
10.1006/rtim.1996.0005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques for automatic video parsing and annotation are crucial to turn enormous volumes of video data into a rich and structured data type, and to facilitate video content-based search and retrieval, In this paper, we present a generic video parser with a Scene Description Language (SDL), The SDL enables the human operator to model a video clip in terms of a relatively high abstract level. The video parser is equipped with various algorithms that are common and essential to general video analyses, To handle the video domain with virtually unlimited sets of unanticipated and variable objects and events efficiently, an object-orientated, processing-on-demand approach is devised to perform the video parsing, The video parser first interprets the video model defined by the operator, identifies the prominent video properties to be parsed, and then creates an entity for each of the video properties, Each entity knows how to find a match for itself from the video properties extracted from the video image, The video parser interacts with these entities, and performs the feature extraction operations with processing-on-demand basis. Each entity has a self-diagnostic function that is able to turn itself into an inert state when it fails to find the necessary matches during the video parsing process. The inert entities will be excluded from subsequent operations, and will no longer consume any system resources, Our experiments have shown that our generic video parser is effective and efficient in handling a large variety of video images. (C) 1996 Academic Press Limited
引用
收藏
页码:45 / 59
页数:15
相关论文
共 8 条
[1]  
GONG YH, 1995, CVGIP IMAG UNDERSTAN, V61
[2]  
GONG YH, 1995, IEEE INT C MULT COMP
[3]  
INTILLE SS, 1994, 296 MIT
[4]  
NAGASAKA A, 1991, P 2 WORK C VIS DAT S, P119
[5]  
SWANBERG D, 1993, P IS T SPIES S EL IM
[6]  
UEDA H, P CHI 91 NEW ORL LA, P343
[7]  
ZHANG HJ, 1993, ACM MULTIMEDIA SYSTE, V1, P10
[8]  
ZHANG HJ, 1994, IEEE INT C MULT COMP