Video indexing based on mosaic representations

被引:166
作者
Irani, M [1 ]
Anandan, P [1 ]
机构
[1] Sarnoff Corp, Princeton, NJ 08540 USA
关键词
compact video representations; mosaics; video annotation; video browsing; video compression; video data bases; video indexing; video manipulation;
D O I
10.1109/5.664279
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video is a rich source of information. It provides visual information about scenes. This information is implicitly buried inside the raw video data, however, and is provided with the cost of very high temporal redundancy. While the standard sequential form of video storage is adequate for viewing in a "movie mode," it fails to support rapid access to information of interest that is required in many of the emerging applications of video. This paper presents an approach for efficient access, use, and manipulation of video data. The video data are first transformed from their sequential and redundant frame-based representation, in which the information about the scene is distributed over many frames, to an explicit and compact scene-based representation, to which each frame can be directly related. This compact reorganization of the video data supports nonlinear browsing and efficient indexing to provide rapid access directly to information of interest. This paper describes a new set of methods for indexing into the video sequence based on the scene-based representation. These indexing methods are based on geometric and dynamic information contained in the video. These methods complement the more traditional "content-based indexing" methods, which utilize image-appearance information (namely, color and texture properties) but are considerably simpler to achieve and are highly computationally efficient.
引用
收藏
页码:905 / 921
页数:17
相关论文
共 37 条
[2]  
Aloimonos Y., 1993, ACTIVE PERCEPTION
[3]  
[Anonymous], P EUR C COMP VIS ECC
[4]  
AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
[5]   A 3-FRAME ALGORITHM FOR ESTIMATING 2-COMPONENT IMAGE MOTION [J].
BERGEN, JR ;
BURT, PJ ;
HINGORANI, R ;
PELEG, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (09) :886-896
[6]  
Burt P. J., 1991, Proceedings of the IEEE Workshop on Visual Motion (Cat. No.91TH0390-5), P187, DOI 10.1109/WVM.1991.212808
[7]  
Darrell T., 1991, Proceedings of the IEEE Workshop on Visual Motion (Cat. No.91TH0390-5), P173, DOI 10.1109/WVM.1991.212810
[8]  
FINKELSTEIN A, 1995, P SIGGRAPH, P277
[9]   QUERY BY IMAGE AND VIDEO CONTENT - THE QBIC SYSTEM [J].
FLICKNER, M ;
SAWHNEY, H ;
NIBLACK, W ;
ASHLEY, J ;
HUANG, Q ;
DOM, B ;
GORKANI, M ;
HAFNER, J ;
LEE, D ;
PETKOVIC, D ;
STEELE, D ;
YANKER, P .
COMPUTER, 1995, 28 (09) :23-32
[10]  
HEEGER DJ, 1997, P SIGGRAPH, P229