Dynamic sound stream formation based on continuity of spectral change

被引:10
作者
Masuda-Katsuse, I [1 ]
Kawahara, H [1 ]
机构
[1] Inst Syst & Informat Technol, Sawara Ku, Fukuoka 8140001, Japan
关键词
auditory scene analysis; computational model; dynamic stream formation; speech segregation; phonemic restoration; prediction;
D O I
10.1016/S0167-6393(98)00084-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A proposed computational model that dynamically tracks and predicts changes in spectral shapes was verified in both psychophysical experiments and engineering applications. The results of the psychophysical experiments confirmed the model's validity and suggested that 'the rule of good continuity' also held in audition. Furthermore, a stream segregation system was implemented with the proposed model. It was composed of simultaneous grouping and sequential integration processes. An effective integration of two processes was performed by dynamically controlling the sequential integration based on the reliability of the output of the simultaneous grouping. Finally, we applied this system to phonemic restoration and segregation of two simultaneous utterances, showing the proposed model to be effective for such engineering applications. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:235 / 259
页数:25
相关论文
共 39 条
[1]  
Aikawa K, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P578, DOI 10.1109/ICSLP.1996.607183
[2]  
AIKAWA K, 1995, J ACOUST SOC AM 2, V98, P2926
[3]  
Akagi M., 1986, Transactions of the Institute of Electronics and Communication Engineers of Japan, Part A, VJ69A, P1277
[4]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.1121/1.408434, DOI 10.7551/MITPRESS/1486.001.0001]
[5]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH DIFFERENT FUNDAMENTAL FREQUENCIES [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (02) :680-697
[6]   Modeling the perception of concurrent vowels: Role of formant transitions [J].
Assmann, PF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) :1141-1152
[7]   THE ROLE OF FORMANT TRANSITIONS IN THE PERCEPTION OF CONCURRENT VOWELS [J].
ASSMANN, PF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :575-584
[8]   PERCEPTION OF SOUNDS CHARACTERIZED BY A RAPIDLY CHANGING RESONANT FREQUENCY [J].
BRADY, PT ;
STEVENS, KN ;
HOUSE, AS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (10) :1357-&
[9]  
Bregman A.S., 1993, THINKING SOUNDS, P10, DOI DOI 10.1093/ACPROF:OSO/9780198522577.001.0001
[10]   COMPUTATIONAL AUDITORY SCENE ANALYSIS [J].
BROWN, GJ ;
COOKE, M .
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04) :297-336