Improving the performance of MPEG compatible encoding at low bit rates using adaptive neural networks

被引:6
作者
Doulamis, N [1 ]
Doulamis, A [1 ]
Kollias, S [1 ]
机构
[1] Natl Tech Univ Athens, Elect & Comp Engn Comp Sci Div, GR-15773 Athens, Greece
关键词
D O I
10.1006/rtim.1999.0185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new approach is presented in this paper for improving the performance of MPEG encoders, especially in videophone or videoconferencing applications, through allocation of a greater number of bits in objects that belong to the foreground of image frames, than in objects that belong to the background. A human face and body detector followed by a neural network classifier are used for foreground/background object extraction. The derived image segmentation is used to modify the rate control of MPEG schemes so as to allocate more bits to foreground objects than to background, while retaining compatibility with MPEG encoders. Experimental results are presented, including image sequences with complex backgrounds, which illustrate the performance of the proposed scheme. Both a subjective image quality improvement and a PSNR increase of about 1.35 db on average have been obtained. (C) 2000 Academic Press.
引用
收藏
页码:327 / 345
页数:19
相关论文
共 32 条
[1]  
[Anonymous], 1988, SELF ORG ASS MEMORY
[2]  
ARGYLE AM, GAZE MUTUAL GAZE
[3]  
ARMITANO RM, 1997, P IEEE INT C AC SPEE, V4, P2685
[4]  
BADIQUE E, 1990, P PICT COD S PCS 90
[5]   MPEG and multimedia communications [J].
Chiariglione, L .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (01) :5-18
[6]   Low bit-rate coding of image sequences using adaptive regions of interest [J].
Doulamis, N ;
Doulamis, A ;
Kalogeras, D ;
Kollias, S .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (08) :928-934
[7]  
DOULAMIS N, 1998, IN PRESS P IEEE INT
[8]   AUTOMATIC FACE LOCATION DETECTION FOR MODEL-ASSISTED RATE CONTROL IN H.261-COMPATIBLE CODING OF VIDEO [J].
ELEFTHERIADIS, A ;
JACQUIN, A .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1995, 7 (4-6) :435-455
[9]  
FLICKNER M, 1995, IEEE COMPUT, V28, P23, DOI DOI 10.1109/2.410146
[10]  
Freer JA, 1995, 29TH ANNUAL 1995 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, P67, DOI 10.1109/CCST.1995.524735