3-D MOTION ESTIMATION IN MODEL-BASED FACIAL IMAGE-CODING

被引:164
作者
LI, HB [1 ]
ROIVAINEN, P [1 ]
FORCHHEIMER, R [1 ]
机构
[1] LINKOPING UNIV,INFORMAT THEORY LAB,S-58183 LINKOPING,SWEDEN
关键词
ADAPTIVE PREDICTION; ANALYSIS-SYNTHESIS TECHNIQUE; COMPUTER GRAPHICS; COMPUTER VISION; FACIAL IMAGE CODING; FEEDBACK TECHNIQUE; MODEL-BASED IMAGE CODING; MOTION TRACKING; NONRIGID MOTION ESTIMATION; 3-D MODELING;
D O I
10.1109/34.216724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the issue of 3-D motion estimation in model-based facial image coding. A new approach to estimating the motion of the head and the facial expressions is presented and has the following characteristics: 1) An affine nonrigid motion model is set up. The specific knowledge about facial shape and facial expression is formulated by this model in the form of parameters. This affine motion model is especially suitable to such a type of nonrigid motion as facial expressions. 2) Based on the affine model, we present a direct method of estimating the two-view motion parameters. Because this method neither necessitates solving the correspondence problem nor computing optical flow, motion parameters can be simply and reliably recovered. 3) Based on the reasonable assumption that the 3-D motion of the face is almost smooth in the time domain, we propose several approaches to predicting the motion of the next frame. In this way, the temporal motion information existing in the image sequence is fully exploited. With a good motion predictor the error arising from the treatment of motion by a linear method will be reduced. 4) Using a 3-D model, the new approach is characterized by a feedback loop connecting computer vision and computer graphics. Embedding the synthesis techniques into the analysis phase greatly improves the performance of motion estimation. Our simulations and experiments with long image sequences of real-world scenes indicate that the method developed in this paper not only greatly reduces computational complexity but also substantially improves estimation accuracy. The synthesized image sequence using the estimated motion parameters, a 3-D model of the face, and a frame of textured image looks very natural.
引用
收藏
页码:545 / 555
页数:11
相关论文
共 34 条
  • [1] ADVI G, 1989, IEEE T PATTERN ANAL, V11, P477
  • [2] AGGARWAL JK, 1988, AUG P IEEE, P917
  • [3] AIZAWA K, 1987, NOV P GLOBECOM 87, P45
  • [4] ESTIMATION OF OBJECT MOTION PARAMETERS FROM NOISY IMAGES
    BROIDA, TJ
    CHELLAPPA, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (01) : 90 - 99
  • [5] BUSCH H, 1989, 3RD P IEE INT C IM P
  • [6] SHAPE AND MOTION OF NONRIGID BODIES
    CHEN, SS
    PENNA, M
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 36 (2-3): : 175 - 207
  • [7] Ekman P, 1978, ENVIRON PSYCH NONVER
  • [8] FORCHHEIMER R, 1989, IEEE T ACOUST SP DEC, V37
  • [9] FORCHHEIMER R, 1983, MAR P PICT COD S PCS, P113
  • [10] FORCHHEIMER R, 1987, JUN P PICT COD S PCS, P171