INTELLIGENT IMAGE INTERPRETATION FOR HIGH-COMPRESSION HIGH-QUALITY SEQUENCE CODING

被引:1
作者
BEDINI, G
FAVALLI, L
MARAZZI, A
MECOCCI, A
ZANARDI, C
机构
[1] Dipartimento di Elettronica, Università di Pavia, Pavia, 2710
来源
EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS | 1995年 / 6卷 / 03期
关键词
D O I
10.1002/ett.4460060306
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Video transmission at very low bit rate has got growing attention in recent years. In this paper we propose an approach for the compression of 144 x 176 pixels Q-CIF video conference sequences. The compression ratio well exceeds 200 : 1 (thus leading to bit-rates under 10 kbit/s for 10 frames/s) with very good psycovisual quality of the reconstructed images. The algorithm integrates and improves different feature extraction and image coding techniques. At first the speaker's head is detected by means of active snakes. A new form of internal energy is defined that allows a very robust and fast head tracking. After head detection, internal facial features (i.e., eyes, nose, and mouth) are located by means of a new algorithm. The image is decomposed into different parts with different psicovisual relevance. This information is used to guide in an intelligent way the subsequent processing of the motion compensated difference image. The important areas are coded more accurately while the less relevant areas are coded in a coarser way. This approach grants very high compression while the image quality remains high. Subpixel block matching is used to obtain a motion compensated difference image. This image is segmented into homogeneous regions that are then coded by means of a technique based on differential chain code.
引用
收藏
页码:255 / 265
页数:11
相关论文
共 45 条
[1]  
Netravali A.N., Limb J.O., Picture coding: a review, Proc. IEEE, 63, pp. 366-406, (1980)
[2]  
Jain A.K., Image data compression: a review, Proc. IEEE, 69, pp. 349-389, (1981)
[3]  
Jayant N., Signal compression: technology targets and research direction, JSAC, 10, pp. 798-818
[4]  
Ahmed N., Nacarajan T., Rao K., pp. 90-93, (1974)
[5]  
Netravali A.N., Haskell B.G., Digital pictures ‐ representation and compression, (1989)
[6]  
Kunt M., Iconomopulos A., Kocher M., Second generation image‐coding techniques, Proceedings of the IEEE, 73, 4, pp. 549-574, (1985)
[7]  
Burt P.J., Adelson E.H., The laplacian pyramid as a compact image code, IEEE Trans, on Coram., 31, pp. 532-540, (1983)
[8]  
Keener M., Kunt M., pp. 131-139, (1993)
[9]  
Ghravi H., Tabatabai A., Subband coding of monochrom and collar images, IEEE Transactions on Circuits and Systems, 35, pp. 207-214, (1988)
[10]  
Gilge M., Engelhardt T., Mehlan R., Coding of arbitrariliy shaped image segments based on a generalised hortogonal transform, Signal Processing: Image Communication, 1, 2, pp. 153-180, (1989)