EXPERIMENTS WITH VOICE MODELING IN SPEECH SYNTHESIS

被引：27

作者：

CARLSON, R

GRANSTROM, B

KARLSSON, I

机构：

[1] Department of Speech Communication and Music Acoustics, Royal Institute of Technology, S-10044 Stockholm

来源：

SPEECH COMMUNICATION | 1991年 / 10卷 / 5-6期

关键词：

SPEECH SYNTHESIS; VOICE MODELING; VOICE SOURCE MODELS; TEXT-TO-SPEECH CONVERSION; FEMALE VOICE;

D O I：

10.1016/0167-6393(91)90051-T

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Some experiments with voice modelling using recent developments of the KTH speech synthesis system will be presented. A new synthesizer, GLOVE, an extended version of OVE III has been implemented in the system. It contains an improved glottal source built on the LF voice source model, some extra control parameters for the voiced and noise sources and an extra pole/zero-pair in the nasal branch. Furthermore, the present research versions of the KTH text-to-speech system include possibilities for interactive manipulations at the parameter level with on-screen reference to natural speech. The synthesis system constitutes a flexible environment for voice modelling experiments. The new synthesis tools and models were used for synthesis-by-analysis experiments. A sentence uttered by a female speaker was analysed and a stylized copy was made using both the old and the new synthesis system. With the new system the synthetic copy sounded very similar to the natural utterance.

引用

页码：481 / 489

页数：9

共 16 条

[1]

BLADON A, 1987, SEP P EUR C SPEECH T, V1, P55

[2]

CARLSON R, 1990, APR P INT C AC SPEEC, P317

[3]

CARLSON R, 1989, SEP P ESCA WORKSH SP

[4]

CARLSON R, 1989, MAY P INT C AC SPEEC, V1, P223

[5]

CARLSON R, 1990, ADV SPEECH HEARING L, P269

[6]

Fant G., 1985, STL QPSR, V26, P1, DOI DOI 10.1016/0167-6393(89)90001-0

[7]

FANT G, 1990, JUN P TUT RES WORKSH, P106

[8]

GOBL C, 1989, IN PRESS P VOCAL FOL

[9]

Gobl C., 1988, STL QPSR, V29, P123

[10]

Karlsson I., 1988, 7th FASE Symposium. Proceedings Speech '88, P225

← 1 2 →