Rate-constrained multihypothesis prediction for motion-compensated video compression

被引:52
作者
Flierl, M [1 ]
Wiegand, T
Girod, B
机构
[1] Stanford Univ, Informat Syst Lab, Stanford, CA 94305 USA
[2] Heinrich Hertz Inst Nachrichtentech Berlin GmbH, Image Proc Dept, D-10587 Berlin, Germany
关键词
entropy-constrained vector quantization; linear prediction; motion-compensated prediction; multiframe prediction; multihypothesis motion-compensated prediction; rate-constrained motion estimation; video coding;
D O I
10.1109/TCSVT.2002.805490
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates linearly combined motion-compensated signals for video compression. In particular, we discuss multiple motion-compensated signals that are jointly estimated for efficient prediction and video coding. First, we extend the wide-sense stationary theory of motion-compensated prediction (MCP) for the case of jointly estimated prediction signals. Our theory suggests that the gain by multihypothesis MCP is limited and that two jointly estimated hypotheses provide a major portion of this achievable gain. In addition, the analysis reveals a property of the displacement error of jointly estimated hypotheses. Second, we present a complete multihypothesis codec which is based on the ITU-T Recommendation H.263 with multiframe capability. Multiframe motion compensation chooses one prediction signal from a set of reference frames, whereas multihypothesis prediction chooses more than one for the linear combination. With our scheme, the time delay associated with B-frames is avoided by choosing more than one prediction signal from previously decoded pictures. Experimental results show that multihypothesis prediction improves significantly coding efficiency by utilizing variable block size and multiframe motion compensation. We show that variable block size and multihypothesis prediction provide gains for different scenarios and that multiframe motion compensation enhances the multihypothesis gain. For example, the presented multihypothesis codec with ten reference frames improves coding efficiency by up to 2.7 dB when compared to the reference codec with one reference frame for the set of investigated test sequences.
引用
收藏
页码:957 / 969
页数:13
相关论文
共 32 条
[1]  
BERGER T, 1971, RATE DISTORATION THE
[2]  
BESAG J, 1986, J R STAT SOC B, V48, P259
[3]  
BUDAGAVI M, 1996, 30 AS C SIGN SYST CO, V2, P953
[4]   ENTROPY-CONSTRAINED VECTOR QUANTIZATION [J].
CHOU, PA ;
LOOKABAUGH, T ;
GRAY, RM .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (01) :31-42
[6]   Multihypothesis motion estimation for video coding [J].
Flierl, M ;
Girod, B .
DCC 2001: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2001, :341-350
[7]   Rate-constrained multi-hypothesis motion-compensated prediction for video coding [J].
Flierl, M ;
Wiegand, T ;
Girod, B .
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, :150-153
[8]   A video codec incorporating block-based multi-hypothesis motion-compensated prediction [J].
Flierl, M ;
Wiegand, T .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 :238-249
[9]   A locally optimal design algorithm for block-based multi-hypothesis motion-compensated prediction [J].
Flierl, M ;
Wiegand, T ;
Girod, B .
DCC '98 - DATA COMPRESSION CONFERENCE, 1998, :239-248
[10]   OPTIMAL NONLINEAR INTERPOLATIVE VECTOR QUANTIZATION [J].
GERSHO, A .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1990, 38 (09) :1285-1287