Modelling the perceptual segregation of double vowels with a network of neural oscillators

被引:15
作者
Brown, GJ
Wang, D
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
[2] Ohio State Univ, Columbus, OH 43210 USA
基金
英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
auditory model; auditory scene analysis; neural network; neural oscillator; perceptual grouping; vowel perception; correlogram;
D O I
10.1016/S0893-6080(97)00046-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability of listeners to identify two simultaneously presented vowels can be improved by introducing a difference in fundamental frequency (FO) between the vowels. We propose an explanation for this phenomenon in the form of a computational model of concurrent sound segregation, which is motivated by neurophysiological evidence of oscillatory firing activity in the auditory cortex and thalamus. More specifically, the model represents the perceptual grouping of auditory frequency channels as synchronised (phase-locked zero phase lag) oscillations in a neural network Computer simulations on a vowel set used in psychophysical studies confirm that the model qualitatively matches the performance of human listeners; vowel identification performance increases with increasing difference in FO. Additionally, the model is able to replicate other findings relating to the perception of harmonic complexes in which one component is mistuned. (C) 1997 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1547 / 1558
页数:12
相关论文
共 59 条
[1]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.7551/MITPRESS/1486.001.0001, 10.1121/1.408434, DOI 10.1121/1.408434]
[2]  
[Anonymous], P ICASSP
[3]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH DIFFERENT FUNDAMENTAL FREQUENCIES [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (02) :680-697
[4]  
BAIRD B, 1996, 17396 CPAM U CAL DEP
[5]   Thalamic modulation of high-frequency oscillating potentials in auditory cortex [J].
Barth, DS ;
MacDonald, KD .
NATURE, 1996, 383 (6595) :78-81
[6]   PITCH IDENTIFICATION OF SIMULTANEOUS DIOTIC AND DICHOTIC 2-TONE COMPLEXES [J].
BEERENDS, JG ;
HOUTSMA, AJM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (02) :813-819
[7]  
BREGMAN AS, 1992, ADV BIOSCI, V83, P417
[8]   COMPUTATIONAL AUDITORY SCENE ANALYSIS [J].
BROWN, GJ ;
COOKE, M .
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04) :297-336
[9]  
BROWN GJ, 1997, IN PRESS READING COM
[10]  
BROWN GJ, 1992, P I ACOUSTICS, V14, P439