Temporal Bayesian Network based contextual framework for structured information mining

被引:2
作者
Mittal, Ankush [1 ]
Pagalthivarthi, Krishnan V.
机构
[1] Indian Inst Technol, Dept Elect & Comp Engn, Roorkee 247667, Uttar Pradesh, India
[2] Indian Inst Technol, Dept Appl Mech, New Delhi 110016, India
关键词
time-to-collision; cut types; contextual cues; dynamic Bayesian networks; semantic features;
D O I
10.1016/j.patrec.2006.12.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Specific domains in video data contain rich temporal structures that help in classification process. In this paper, we exploit the temporal structure to characterize video sequence data into different classes. We propose the following perceptual features: Time-to-Collision, shot length and transition, and temporal motion activity. Using these perceptual features, several video classes are characterized leading to formation of high-level sequence classification. Resulting high-level queries are more easily mapped onto the perceptual features enabling better accessibility of content-based retrieval systems. Temporal fusion of the perceptual features forms higher-level structures, which can be effectively tackled using the Dynamic Bayesian Networks. The Networks allow the power of statistical inference and learning to be combined with the temporal and contextual knowledge of the problem. The modeling and experimental results are presented for a number of key applications, like sequence identification, extracting highlights for sports, and parsing a news program. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1873 / 1884
页数:12
相关论文
共 41 条
[1]  
AIGRAIN P, 1996, INTELLIGENT MUTLIMED, P159
[2]  
Alattar A. M., 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187), P249, DOI 10.1109/ISCAS.1998.698806
[3]  
[Anonymous], THESIS U ILLINOIS UR
[4]  
[Anonymous], 1982, Visual perception: Essential readings
[5]  
ARDIZZONE E, 1996, P INT C PATT REC ICP, V3, P135
[6]  
BALAZS B, 1952, THEORY FILM, P118
[7]   Filmic space-time diagrams for video structure representation [J].
Butler, S ;
Parkes, AP .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1996, 8 (04) :269-280
[8]  
Dempster AP., 1977, MAXIMUM LIKELIHOOD I, P1
[9]  
FABLET R, 1999, P 3 INT C VIS INF SY, P221
[10]  
GARG A, 2000, IEEE C AUT FAC GEST, P384