QUALITY OF SPEECH PRODUCED BY ANALYSIS-SYNTHESIS

被引:12
作者
CHILDERS, DG
WU, K
机构
[1] Dept. of Electrical Engineering, University of Florida, Gainesville
关键词
formant synthesizer; linear prediction synthesizer; quality; Speech; synthesis;
D O I
10.1016/0167-6393(90)90064-G
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We review factors that have affected the synthesis of high-quality speech by analysis-synthesis. The influence of a selected subset of these factors on the quality of synthesized speech was evaluated through listener preference judgements by comparing natural speech to the synthetic speech of two synthesizers: linear prediction coding (LPC) and formant. Several synthesizer excitation waveforms were considered. These waveforms included critical parameters that replicated selected glottal timing events, e.g., the instants of glottal closure and glottal opening. In addition, identifying voiced/unvoiced/mixed excitation and silent intervals in the speech waveform and measuring the fundamental frequency of voicing contributed to the synthesis of high-quality speech. A two-channel approach to speech analysis is recommended to aid the automatic processing of speech, where one channel is the conventional acoustic signal, while the other channel is the electroglottogram (EGG). © 1990.
引用
收藏
页码:97 / 117
页数:21
相关论文
共 121 条
[71]   PERTURBATIONS IN VOCAL PITCH [J].
LIEBERMAN, P .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (05) :597-&
[72]   SEGMENTAL INTELLIGIBILITY OF SYNTHETIC SPEECH PRODUCED BY RULE [J].
LOGAN, JS ;
GREENE, BG ;
PISONI, DB .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (02) :566-581
[73]   CAPACITY DEMANDS IN SHORT-TERM-MEMORY FOR SYNTHETIC AND NATURAL SPEECH [J].
LUCE, PA ;
FEUSTEL, TC ;
PISONI, DB .
HUMAN FACTORS, 1983, 25 (01) :17-32
[74]  
MACK MA, 1985, ADA160401
[75]   VECTOR QUANTIZATION IN SPEECH CODING [J].
MAKHOUL, J ;
ROUCOS, S ;
GISH, H .
PROCEEDINGS OF THE IEEE, 1985, 73 (11) :1551-1588
[76]  
MAKHOUL J, 1976, IEEE T ACOUST SPEECH, P103
[77]  
Markel J., 1976, LINEAR PREDICTION SP
[78]   LEAST MEAN-SQUARE MEASURES OF VOICE PERTURBATION [J].
MILENKOVIC, P .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1987, 30 (04) :529-538
[79]   STUDY OF VARIATIONS IN MALE AND FEMALE GLOTTAL WAVE [J].
MONSEN, RB ;
ENGEBRETSON, AM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (04) :981-993
[80]   MULTIDIMENSIONAL-ANALYSIS OF MALE AND FEMALE VOICES [J].
MURRY, T ;
SINGH, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (05) :1294-1300