Generalized B pictures and the draft H.264/AVC video-com-pression standard

被引：104

作者：

Flierl, M ^{[1
]}

Girod, B ^{[1
]}

机构：

[1] Stanford Univ, Informat Syst Lab, Stanford, CA 94305 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2003年 / 13卷 / 07期

关键词：

B pictures; motion-compensated prediction; multiframe prediction; multihypothesis motion-compensated prediction; temporal scalability; video coding;

D O I：

10.1109/TCSVT.2003.814963

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper reviews recent advances in using B pictures in the context of the draft H.264/AVC video-compression standard. We focus on reference picture selection and linearly combined motion-compensated prediction signals. We show that bidirectional prediction exploits partially the efficiency of combined prediction signals whereas multihypothesis prediction allows a more general form of B pictures. The general concept of linearly combined prediction signals chosen from an arbitrary set of reference pictures improves the H.264/AVC test model TML-9 which is used in the following. We outline H.264/AVC macroblock prediction modes for B pictures, classify them into four groups and compare their efficiency in terms of rate-distortion performance. When investigating multihypothesis prediction, we show that bidirectional prediction is a special case of this concept. Multihypothesis prediction allows also two combined forward prediction signals. Experimental results show that this case is also advantageous in terms of compression efficiency. The draft H. 264/AVC video-compression standard offers improved entropy coding by context-based adaptive binary arithmetic coding. Simulations show that the gains by multihypothesis prediction and arithmetic coding are additive. B pictures establish an enhancement layer and are predicted from reference pictures that are provided by the base layer. The quality of the base layer influences the rate-distortion trade-off for B pictures. We demonstrate how the quality of the B pictures should be reduced to improve the overall rate-distortion performance of the scalable representation.

引用

页码：587 / 597

页数：11

共 28 条

[1] FIXED AND ADAPTIVE PREDICTORS FOR HYBRID PREDICTIVE TRANSFORM CODING [J].

ERICSSON, S .

IEEE TRANSACTIONS ON COMMUNICATIONS, 1985, 33 (12) :1291-1302

[2] Multihypothesis motion estimation for video coding [J].

Flierl, M ;

Girod, B .

DCC 2001: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2001, :341-350

[3] Rate-constrained multi-hypothesis motion-compensated prediction for video coding [J].

Flierl, M ;

Wiegand, T ;

Girod, B .

2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, :150-153

[4] A video codec incorporating block-based multi-hypothesis motion-compensated prediction [J].

Flierl, M ;

Wiegand, T .

VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 :238-249

[5]

FLIERL M, 2001, FURTHER INVESTIGATIO

[6]

Flierl M., 2001, P PICT COD S SEOUL K, P195

[7]

FLIERL M, 2001, P IEEE INT C IM PROC, V3, P526

[8] Efficiency analysis of multihypothesis motion-compensated prediction for video coding [J].

Girod, B .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (02) :173-183

[9]

*ISO IEC, 1996, 138182 ISOIEC

[10]

JEON B, 2001, MODE DECISION B PICT

← 1 2 3 →