VOICE CONVERSION ALGORITHM-BASED ON PIECEWISE-LINEAR CONVERSION RULES OF FORMANT FREQUENCY AND SPECTRUM TILT

被引:61
作者
MIZUNO, H [1 ]
ABE, M [1 ]
机构
[1] NIPPON TELEGRAPH & TEL PUBL CORP, HUMAN INTERFACE LABS, YOKOSUKA, KANAGAWA 23803, JAPAN
关键词
VOICE CONVERSION; FORMANT FREQUENCY; SPECTRAL INTENSITY; SPECTRUM TILT; PIECEWISE LINEAR; LISTENING TEST;
D O I
10.1016/0167-6393(94)00052-C
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
This article presents a new algorithm used in order to convert the speech of one speaker so that it sounds like that of another speaker. This algorithm flexibly converts voice quality using two major technical developments. Firstly, the modification of formant frequencies and spectral intensity using piecewise linear voice conversion rules. This enables the control of spectrum parameters in detail. The conversion rules are generated automatically for any pair of speakers. The reliability of the conversion rules is guaranteed because they are statistically generated using training data. Secondly, this algorithm provides the ability to produce speech with the desired formant structure by controlling formant frequencies, formant bandwidths and spectral intensity. Speech is iteratively modified in order to achieve the specified formant structure. Listening tests prove that the proposed algorithm converts speaker individuality while maintaining high speech quality.
引用
收藏
页码:153 / 164
页数:12
相关论文
共 18 条
[1]
ABE M, 1988, ICASSP, P565
[2]
VOICE CONVERSION [J].
CHILDERS, DG ;
WU, K ;
HICKS, DM ;
YEGNANARAYANA, B .
SPEECH COMMUNICATION, 1989, 8 (02) :147-158
[3]
Fant G., 1960, ACOUSTIC THEORY SPEE
[4]
FRANAGAN JL, 1972, SPEECH ANALYSIS SYNT
[5]
FURUI S, 1989, DIGITAL SPEECH PROCE, P97
[6]
Hamon C., 1989, ICASSP-89: 1989 International Conference on Acoustics, Speech and Signal Processing (IEEE Cat. No.89CH2673-2), P238, DOI 10.1109/ICASSP.1989.266409
[7]
HAYASHI C, 1985, BEHAVIORMETRIKA
[8]
ITOH K, 1982, T IEICE JAPAN A, V65, P101
[9]
ANALYSIS, SYNTHESIS, AND PERCEPTION OF VOICE QUALITY VARIATIONS AMONG FEMALE AND MALE TALKERS [J].
KLATT, DH ;
KLATT, LC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (02) :820-857
[10]
KLATT DH, 1982, INT C ACOUST SPEECH, P1589