TRACK-DRAW - A GRAPHICAL INTERFACE FOR CONTROLLING THE PARAMETERS OF A SPEECH SYNTHESIZER

被引：11

作者：

ASSMANN, P

BALLARD, W

BORNSTEIN, L

PASCHALL, D

机构：

[1] School of Human Development, The University of Texas at Dallas, Richardson, 75083-0688, TX

来源：

BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS | 1994年 / 26卷 / 04期

关键词：

D O I：

10.3758/BF03204661

中图分类号：

B841 [心理学研究方法];

学科分类号：

040201 ;

摘要：

In this report we describe a graphical interface for generating voiced speech using a frequency-domain implementation of the Klatt (1980) cascade formant synthesizer. The input to the synthesizer is a set of parameter vectors, called tracks, which specify the overall amplitude, fundamental frequency, formant frequencies, and formant bandwidths at specified time intervals. Tracks are drawn with the aid of a computer mouse that can be used either in point-draw mode, which selects a parameter value for a single time frame, or in line-draw mode, which uses piecewise linear interpolation to connect two user-selected endpoints. Three versions of the program are described: (1) SYNTH draws tracks on an empty time-frequency grid, (2) SPECSYNTH creates a spectrogram of a recorded signal upon which tracks can be superimposed, and (3) SWSYNTH is similar to SPECSYNTH, except that it generates sine-wave speech (Remez, Rubin, Pisoni, & Carrell, 1981) using a set of time-varying sinusoids rather than cascaded formants. The program is written for MATLAB, an interactive computing environment for matrix computation. Track-Draw provides a useful tool for investigating the perceptually salient properties of voiced speech and other sounds.

引用

页码：431 / 436

页数：6

共 6 条

[1]

BICKLEY CA, 1992, J ACOUST SOC AM, V91, P2442

[2]

JAMIESON DG, 1993, J ACOUST SOC AM, V93, P2394

[3] SOFTWARE FOR A CASCADE-PARALLEL FORMANT SYNTHESIZER [J].

KLATT, DH .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (03) :971-995

[4]

KLATT DH, 1982, MIT RES LABORATORY E, P47

[5]

Rabiner L. R., 1978, THEORY APPL DIGITAL

[6] SPEECH-PERCEPTION WITHOUT TRADITIONAL SPEECH CUES [J].

REMEZ, RE ;

RUBIN, PE ;

PISONI, DB ;

CARRELL, TD .

SCIENCE, 1981, 212 (4497) :947-950

← 1 →