QUALITY OF SPEECH PRODUCED BY ANALYSIS-SYNTHESIS

被引:12
作者
CHILDERS, DG
WU, K
机构
[1] Dept. of Electrical Engineering, University of Florida, Gainesville
关键词
formant synthesizer; linear prediction synthesizer; quality; Speech; synthesis;
D O I
10.1016/0167-6393(90)90064-G
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We review factors that have affected the synthesis of high-quality speech by analysis-synthesis. The influence of a selected subset of these factors on the quality of synthesized speech was evaluated through listener preference judgements by comparing natural speech to the synthetic speech of two synthesizers: linear prediction coding (LPC) and formant. Several synthesizer excitation waveforms were considered. These waveforms included critical parameters that replicated selected glottal timing events, e.g., the instants of glottal closure and glottal opening. In addition, identifying voiced/unvoiced/mixed excitation and silent intervals in the speech waveform and measuring the fundamental frequency of voicing contributed to the synthesis of high-quality speech. A two-channel approach to speech analysis is recommended to aid the automatic processing of speech, where one channel is the conventional acoustic signal, while the other channel is the electroglottogram (EGG). © 1990.
引用
收藏
页码:97 / 117
页数:21
相关论文
共 121 条
[51]  
ITOH K, 1984, REV ELEC COMMUN LAB, V32, P220
[52]  
Jayant N. S., 1984, DIGITAL CODING WAVEF
[53]   DISTORTION PERFORMANCE OF VECTOR QUANTIZATION FOR LPC VOICE CODING [J].
JUANG, BH ;
WONG, DY ;
GRAY, AH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1982, 30 (02) :294-304
[54]   ON USING THE ITAKURA-SAITO MEASURES FOR SPEECH CODER PERFORMANCE EVALUATION [J].
JUANG, BH .
AT&T BELL LABORATORIES TECHNICAL JOURNAL, 1984, 63 (08) :1477-1498
[55]  
KAHN M, 1983, IEEE T ACOUST SPEECH, P531
[56]   PHOTOGLOTTOGRAPHICAL STUDY OF FEMALE VOCAL FOLDS DURING PHONATION [J].
KITZING, P ;
SONESSON, B .
FOLIA PHONIATRICA, 1974, 26 (02) :138-149
[57]   SOFTWARE FOR A CASCADE-PARALLEL FORMANT SYNTHESIZER [J].
KLATT, DH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (03) :971-995
[58]   REVIEW OF TEXT-TO-SPEECH CONVERSION FOR ENGLISH [J].
KLATT, DH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (03) :737-793
[59]   GLOTTAL-AREA TIME FUNCTION AND SUBGLOTTAL-PRESSURE VARIATION [J].
KOIKE, Y ;
HIRANO, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 54 (06) :1618-1627
[60]   GLOTTAL SOURCE VOCAL-TRACT INTERACTION [J].
KOIZUMI, T ;
TANIGUCHI, S ;
HIROMITSU, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05) :1541-1547