TRACK-DRAW - A GRAPHICAL INTERFACE FOR CONTROLLING THE PARAMETERS OF A SPEECH SYNTHESIZER

被引:11
作者
ASSMANN, P
BALLARD, W
BORNSTEIN, L
PASCHALL, D
机构
[1] School of Human Development, The University of Texas at Dallas, Richardson, 75083-0688, TX
来源
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS | 1994年 / 26卷 / 04期
关键词
D O I
10.3758/BF03204661
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
In this report we describe a graphical interface for generating voiced speech using a frequency-domain implementation of the Klatt (1980) cascade formant synthesizer. The input to the synthesizer is a set of parameter vectors, called tracks, which specify the overall amplitude, fundamental frequency, formant frequencies, and formant bandwidths at specified time intervals. Tracks are drawn with the aid of a computer mouse that can be used either in point-draw mode, which selects a parameter value for a single time frame, or in line-draw mode, which uses piecewise linear interpolation to connect two user-selected endpoints. Three versions of the program are described: (1) SYNTH draws tracks on an empty time-frequency grid, (2) SPECSYNTH creates a spectrogram of a recorded signal upon which tracks can be superimposed, and (3) SWSYNTH is similar to SPECSYNTH, except that it generates sine-wave speech (Remez, Rubin, Pisoni, & Carrell, 1981) using a set of time-varying sinusoids rather than cascaded formants. The program is written for MATLAB, an interactive computing environment for matrix computation. Track-Draw provides a useful tool for investigating the perceptually salient properties of voiced speech and other sounds.
引用
收藏
页码:431 / 436
页数:6
相关论文
共 6 条
[1]  
BICKLEY CA, 1992, J ACOUST SOC AM, V91, P2442
[2]  
JAMIESON DG, 1993, J ACOUST SOC AM, V93, P2394
[3]   SOFTWARE FOR A CASCADE-PARALLEL FORMANT SYNTHESIZER [J].
KLATT, DH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (03) :971-995
[4]  
KLATT DH, 1982, MIT RES LABORATORY E, P47
[5]  
Rabiner L. R., 1978, THEORY APPL DIGITAL
[6]   SPEECH-PERCEPTION WITHOUT TRADITIONAL SPEECH CUES [J].
REMEZ, RE ;
RUBIN, PE ;
PISONI, DB ;
CARRELL, TD .
SCIENCE, 1981, 212 (4497) :947-950