Modeling the perception of concurrent vowels: Role of formant transitions

被引:16
作者
Assmann, PF
机构
[1] School of Human Development, University of Texas at Dallas, Box 830688, Richardson
关键词
D O I
10.1121/1.416299
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When two synthetic vowels are presented concurrently and monaurally, listeners identify the members of the pair more accurately if they differ in fundamental frequency (F-0), or if one of them is preceded or followed by formant transitions that specify a glide or Liquid consonant. However, formant transitions do not help listeners identify the vowel to which they are linked; instead, they make the competing vowel easier to identify. One explanation is that the formant transition region provides a brief time interval during which the competing vowel is perceptually more pi-eminent. This interpretation is supported by the predictions of two computational models of the identification of concurrent vowels that (i) perform a frequency analysis using a bank of bandpass filters, (ii) analyze the waveform in each channel using a brief, sliding temporal window, and ()determine which region of the signal provides the strongest evidence of each vowel. Model A [Culling and Darwin, J. Acoust. Soc. Am 95, 1559-1569 (1994)] computes the rms energy in each channel at successive time intervals to generate running excitation patterns that serve as input to a vowel classifier, implemented as a linear associative neural network. Model B uses a temporal analysis in each channel to generate running autocorrelation functions, and it includes a further stage of source segregation [Meddis and Hewitt, J. Acoust. Soc. Am. 91, 233-245 (1992)] to partition the channels into two groups, one group containing evidence of the periodicity of the vowel with the dominant F-0, the other group providing evidence of the competing vowel. Both models predicted effects of F-0 and formant transitions on identification, but model B provided more accurate predictions of the pattern of listeners' identification responses. Taken together, the empirical and modelling results support the idea that the identification of concurrent vowels an analysis of the composite waveform using a sliding temporal window, combined with a form of F-0-guided source segregation. (C) 1996 Acoustical Society of America.
引用
收藏
页码:1141 / 1152
页数:12
相关论文
共 39 条
[1]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.7551/MITPRESS/1486.001.0001, 10.1121/1.408434, DOI 10.1121/1.408434]
[2]  
[Anonymous], 1993, 35 APPL COMP INC
[3]  
[Anonymous], 1986, EXPLOR MICROSTRUCT C
[4]   TRACK-DRAW - A GRAPHICAL INTERFACE FOR CONTROLLING THE PARAMETERS OF A SPEECH SYNTHESIZER [J].
ASSMANN, P ;
BALLARD, W ;
BORNSTEIN, L ;
PASCHALL, D .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1994, 26 (04) :431-436
[5]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH THE SAME FUNDAMENTAL-FREQUENCY [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (01) :327-338
[6]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH DIFFERENT FUNDAMENTAL FREQUENCIES [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (02) :680-697
[7]   THE CONTRIBUTION OF WAVE-FORM INTERACTIONS TO THE PERCEPTION OF CONCURRENT VOWELS [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (01) :471-484
[8]   THE ROLE OF FORMANT TRANSITIONS IN THE PERCEPTION OF CONCURRENT VOWELS [J].
ASSMANN, PF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :575-584
[9]  
ASSMANN PF, 1994, J ACOUST SOC AM, V95, P2965
[10]  
ASSMANN PF, 1993, 16 MIDW RES M ASS RE, P258