Efficient representations of video sequences and their applications

被引:129
作者
Irani, M
Anandan, P
Bergen, J
Kumar, R
Hsu, S
机构
[1] David Sarnoff Research Center, CN5300, Princeton
关键词
video representation; mosaic images; motion analysis; image registration; video databases; video compression; video enhancement; video visualization; video indexing; video manipulation;
D O I
10.1016/0923-5965(95)00055-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, there has been a growing interest in the use of mosaic images to represent the information contained in video sequences. This paper systematically investigates how to go beyond thinking of the mosaic simply as a visualization device, but rather as a basis for an efficient and complete representation of video sequences. We describe two different types of mosaics called the static and the dynamic mosaics that are suitable for different needs and scenarios. These two types of mosaics are unified and generalized in a mosaic representation called the temporal pyramid. To handle sequences containing large variations in image resolution, we develop a multiresolution mosaic. We discuss a series of increasingly complex alignment transformations (ranging from 2D to 3D and layers) for making tile mosaics. We describe techniques for the basic elements of the mosaic construction process, namely sequence alignment, sequence integration into a mosaic image, and residual analysis to represent information not captured by the mosaic image. We describe several powerful video applications of mosaic representations including video compression, video enhancement, enhanced visualization, and other applications in video indexing, search, and manipulation.
引用
收藏
页码:327 / 351
页数:25
相关论文
共 22 条
[1]  
ADELSON EH, 1991, 181 MIT MED LAB VIS
[2]   A 3-FRAME ALGORITHM FOR ESTIMATING 2-COMPONENT IMAGE MOTION [J].
BERGEN, JR ;
BURT, PJ ;
HINGORANI, R ;
PELEG, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (09) :886-896
[3]  
BERGEN JR, 1992, 2ND P EUR C COMP VIS, P237
[4]   DETERMINING OPTICAL-FLOW [J].
HORN, BKP ;
SCHUNCK, BG .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :185-203
[5]   COMPUTING OCCLUDING AND TRANSPARENT MOTIONS [J].
IRANI, M ;
ROUSSO, B ;
PELEG, S .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :5-16
[6]  
Irani M., 1993, Journal of Visual Communication and Image Representation, V4, P324, DOI 10.1006/jvci.1993.1030
[7]   IMPROVING RESOLUTION BY IMAGE REGISTRATION [J].
IRANI, M ;
PELEG, S .
CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1991, 53 (03) :231-239
[8]   VIDEO COMPRESSION USING MOSAIC REPRESENTATIONS [J].
IRANI, M ;
HSU, S ;
ANANDAN, P .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1995, 7 (4-6) :529-552
[9]  
IRANI M, 1992, P IEEE C COMP VIS PA
[10]  
IRANI M, 1992, P EUR C COMP VIS, P282