3-D MOTION ESTIMATION IN MODEL-BASED FACIAL IMAGE-CODING

被引：164

作者：

LI, HB ^{[1
]}

ROIVAINEN, P ^{[1
]}

FORCHHEIMER, R ^{[1
]}

机构：

[1] LINKOPING UNIV,INFORMAT THEORY LAB,S-58183 LINKOPING,SWEDEN

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 1993年 / 15卷 / 06期

关键词：

ADAPTIVE PREDICTION; ANALYSIS-SYNTHESIS TECHNIQUE; COMPUTER GRAPHICS; COMPUTER VISION; FACIAL IMAGE CODING; FEEDBACK TECHNIQUE; MODEL-BASED IMAGE CODING; MOTION TRACKING; NONRIGID MOTION ESTIMATION; 3-D MODELING;

D O I：

10.1109/34.216724

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the issue of 3-D motion estimation in model-based facial image coding. A new approach to estimating the motion of the head and the facial expressions is presented and has the following characteristics: 1) An affine nonrigid motion model is set up. The specific knowledge about facial shape and facial expression is formulated by this model in the form of parameters. This affine motion model is especially suitable to such a type of nonrigid motion as facial expressions. 2) Based on the affine model, we present a direct method of estimating the two-view motion parameters. Because this method neither necessitates solving the correspondence problem nor computing optical flow, motion parameters can be simply and reliably recovered. 3) Based on the reasonable assumption that the 3-D motion of the face is almost smooth in the time domain, we propose several approaches to predicting the motion of the next frame. In this way, the temporal motion information existing in the image sequence is fully exploited. With a good motion predictor the error arising from the treatment of motion by a linear method will be reduced. 4) Using a 3-D model, the new approach is characterized by a feedback loop connecting computer vision and computer graphics. Embedding the synthesis techniques into the analysis phase greatly improves the performance of motion estimation. Our simulations and experiments with long image sequences of real-world scenes indicate that the method developed in this paper not only greatly reduces computational complexity but also substantially improves estimation accuracy. The synthesized image sequence using the estimated motion parameters, a 3-D model of the face, and a frame of textured image looks very natural.

引用

页码：545 / 555

页数：11

共 34 条

[1] ADVI G, 1989, IEEE T PATTERN ANAL, V11, P477
[2] AGGARWAL JK, 1988, AUG P IEEE, P917
[3] AIZAWA K, 1987, NOV P GLOBECOM 87, P45
[4] ESTIMATION OF OBJECT MOTION PARAMETERS FROM NOISY IMAGES
BROIDA, TJ
CHELLAPPA, R
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (01) : 90 - 99
[5] BUSCH H, 1989, 3RD P IEE INT C IM P
[6] SHAPE AND MOTION OF NONRIGID BODIES
CHEN, SS
PENNA, M
[J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 36 (2-3): : 175 - 207
[7] Ekman P, 1978, ENVIRON PSYCH NONVER
[8] FORCHHEIMER R, 1989, IEEE T ACOUST SP DEC, V37
[9] FORCHHEIMER R, 1983, MAR P PICT COD S PCS, P113
[10] FORCHHEIMER R, 1987, JUN P PICT COD S PCS, P171

← 1 2 3 4 →