Audiovisual Speech Synthesis

被引:33
作者
G. Bailly
M. Bérar
F. Elisei
M. Odisio
机构
[1] Institut de la Communication Parlée UMR CNRS no 5009 INPG/Univ. Stendhal 46,
关键词
text-to-speech synthesis; audiovisual synthesis; facial animation; talking faces;
D O I
10.1023/A:1025700715107
中图分类号
学科分类号
摘要
This paper presents the main approaches used to synthesize talking faces, and provides greater detail on a handful of these approaches. An attempt is made to distinguish between facial synthesis itself (i.e. the manner in which facial movements are rendered on a computer screen), and the way these movements may be controlled and predicted using phonetic input. The two main synthesis techniques (model-based vs. image-based) are contrasted and presented by a brief description of the most illustrative existing systems. The challenging issues—evaluation, data acquisition and modeling—that may drive future models are also discussed and illustrated by our current work at ICP.
引用
收藏
页码:331 / 346
页数:15
相关论文
共 61 条
[51]  
Ostermann J.(undefined)undefined undefined undefined undefined-undefined
[52]  
Terzopoulos D.(undefined)undefined undefined undefined undefined-undefined
[53]  
Waters K.(undefined)undefined undefined undefined undefined-undefined
[54]  
Turk M.(undefined)undefined undefined undefined undefined-undefined
[55]  
Pentland A.(undefined)undefined undefined undefined undefined-undefined
[56]  
Waters K.(undefined)undefined undefined undefined undefined-undefined
[57]  
Waters K.(undefined)undefined undefined undefined undefined-undefined
[58]  
Terzopoulos D.(undefined)undefined undefined undefined undefined-undefined
[59]  
Yamamoto E.(undefined)undefined undefined undefined undefined-undefined
[60]  
Nakamura S.(undefined)undefined undefined undefined undefined-undefined