Motion analysis in 3D DCT domain and its application to video coding

被引：24

作者：

Bozinovic, N ^{[1
]}

Konrad, J ^{[1
]}

机构：

[1] Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USA

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2005年 / 20卷 / 06期

关键词：

motion analysis; discrete cosine transform; DCT; video coding; 3D transform coding; coefficient quantization; coefficient scanning;

D O I：

10.1016/j.image.2005.03.007

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Global, constant-velocity, translational motion in an image sequence induces a characteristic energy footprint in the Fourier-transform (FT) domain; spectrum is limited to a plane with orientation defined by the direction of motion. By detecting these spectral occupancy planes, methods have been proposed to estimate such global motion. Since the discrete cosine transform (DCT) is a ubiquitous tool of all video compression standards to date, we investigate in this paper properties of motion in the DCT domain. We show that global, constant-velocity, translational motion in an image sequence induces in the DCT domain spectral occupancy planes, similarly to the FT domain. Unlike in the FT case, however, these planes are subject to spectral folding. Based on this analysis, we propose a motion estimation method in the DCT domain, and we show that results comparable to standard block matching can be obtained. Moreover, by realizing that significant energy in the DCT domain concentrates around a folded plane, we propose a new approach to video compression. The approach is based on 3D DCT applied to a group of frames, followed by motion-adaptive scanning of DCT coefficients (akin to "zig-zag" scanning in MPEG coders), their adaptive quantization, and final entropy coding. We discuss the design of the complete 3D DCT coder and we carry out a performance comparison of the new coder with ubiquitous hybrid coders. (c) 2005 Elsevier B.V. All rights reserved.

引用

页码：510 / 528

页数：19

共 22 条

[1] DISCRETE COSINE TRANSFORM [J].

AHMED, N ;

NATARAJAN, T ;

RAO, KR .

IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93

[2]

BJONTEGAARD G, P VCEG 13 M AUST TX

[3]

BOZINOVIC N, 2003, P IS T SPIE S IM VID, P1204

[4] Motion-compensated 3-D subband coding of video [J].

Choi, SJ ;

Woods, JW .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (02) :155-167

[5] THE SAMPLING AND RECONSTRUCTION OF TIME-VARYING IMAGERY WITH APPLICATION IN VIDEO SYSTEMS [J].

DUBOIS, E .

PROCEEDINGS OF THE IEEE, 1985, 73 (04) :502-522

[6]

HEEGER DJ, 1987, INT J COMPUT VISION, V1, P279, DOI 10.1007/BF00133568

[7] DERIVATION OF OPTICAL-FLOW USING A SPATIOTEMPORAL-FREQUENCY APPROACH [J].

JACOBSON, L ;

WECHSLER, H .

COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 38 (01) :29-65

[8]

KARLSSON G, 1988, P IEEE INT C AC SPEE, V2, P1100

[9] An embedded wavelet video coder using three-dimensional set partitioning in hierarchical trees (SPIHT) [J].

Kim, BJ ;

Pearlman, WA .

DCC '97 : DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1997, :251-260

[10]

KONRAD J, 2002, P IEEE INT C IM PROC, V2, P281

← 1 2 3 →