SOME PROBLEMS IN VOICE SOURCE ANALYSIS

被引:55
作者
FANT, G
机构
[1] Department of Speech Communication and Music Acoustics, Royal Institute of Technology, KTH, S-10044 Stockholm
关键词
VOICE PRODUCTION THEORY; INVERSE FILTERING; GLOTTAL FLOW; VOICE SOURCE DYNAMICS; SOURCE SPECTRUM;
D O I
10.1016/0167-6393(93)90055-P
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This is an overview of some recent studies of voice source acoustics and glottal flow analysis and modelling performed at the KTH. Time and frequency domain aspects of the production process are discussed with a view of relating glottal flow parameters from inverse filtering and vocal tract transfer functions to formant amplitudes and bandwidths. Alternative methods of determining the time constant T(a) = 1/(2piF(a)) in the return phase of glottal flow derivative after the instant of excitation, and thus of spectral tilt, are discussed. Selective inverse filtering, removing all but one formant, is potentially useful for this purpose. The influence of uncertainties in quantifying the vocal tract transfer function is exemplified by a calculation of the effects of introducing a finite baffle effect of the human head adding a high-frequency emphasis above the standard + 6 dB/octave. Particular attention has been paid to temporal variations within an utterance as derived from continuous inverse filtering. Aspects of breathy voicing and female-male differences in voice production are discussed. It is demonstrated that the temporal profile of the excitation amplitude, E(e)(t), within an utterance derived from a male speaker can be approximated by the envelope of the negative part of the speech wave.
引用
收藏
页码:7 / 22
页数:16
相关论文
共 51 条
[1]  
ANANTHAPADMANAB.TV, 1982, SPEECH COMMUN, V1, P167
[2]  
Ananthapadmanabha T., 1984, STL QPSR, P1
[3]  
BICKLEY C, 1991, VOCAL FOLD PHYSL ACO, P37
[4]   EFFECTS OF A VOCAL-TRACT CONSTRICTION ON THE GLOTTAL SOURCE - EXPERIMENTAL AND MODELING STUDIES [J].
BICKLEY, CA ;
STEVENS, KN .
JOURNAL OF PHONETICS, 1986, 14 (3-4) :373-382
[5]  
BRIESS B, 1962, STLQPSR, P6
[6]   EXPERIMENTS WITH VOICE MODELING IN SPEECH SYNTHESIS [J].
CARLSON, R ;
GRANSTROM, B ;
KARLSSON, I .
SPEECH COMMUNICATION, 1991, 10 (5-6) :481-489
[7]  
CARLSON R, 1989, MAY P INT C AC SPEEC, V1, P223
[8]   FORMANT-AMPLITUDE MEASUREMENTS [J].
FANT, G ;
LILJENCRANTS, J ;
MARTONY, J ;
FINTOFT, K ;
LINDBLOM, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1963, 35 (11) :1753-+
[9]   PROSODIC AND SEGMENTAL SPEAKER VARIATIONS [J].
FANT, G ;
KRUCKENBERG, A ;
NORD, L .
SPEECH COMMUNICATION, 1991, 10 (5-6) :521-531
[10]   GLOTTAL FLOW - MODELS AND INTERACTION [J].
FANT, G .
JOURNAL OF PHONETICS, 1986, 14 (3-4) :393-399