SEPARATION OF CONCURRENT HARMONIC SOUNDS - FUNDAMENTAL-FREQUENCY ESTIMATION AND A TIME-DOMAIN CANCELLATION MODEL OF AUDITORY PROCESSING

被引:116
作者
DECHEVEIGNE, A [1 ]
机构
[1] ATR,AUDITORY & VISUAL PERCEPT RES LABS,KYOTO 61902,JAPAN
关键词
D O I
10.1121/1.405712
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Signal-processing methods and auditory models for separation of concurrent harmonic sounds are reviewed, and a processing principle is proposed that cancels harmonic interference in the time domain. The principle is first formulated in signal processing terms as a time-domain comb filter. The critical issue of fundamental frequency estimation is investigated and an algorithm is proposed. Tested on a restricted database of natural voiced speech, the algorithm successfully found estimates correct within 3% of an octave for 90% of all frames. Next, the principle is formulated in physiological terms- A hypothetical ''neural comb filter'' is described, based on neural delay lines and inhibitory synapses, and tested using auditory-nerve fiber discharge data obtained in response to concurrent vowels [A. R. Palmer, J. Acoust. Soc. Am. 88, 1412-1426 (1990)]. Processing successfully suppresses the correlates of either vowel in the response of fibers that respond to both, allowing the other vowel to be better represented. The filter belongs to the class of ''cancellation models'' for which predictions can be made concerning the outcome of certain psychoacoustic experiments. These predictions are discussed in relation to recent experimental results obtained elsewhere.
引用
收藏
页码:3271 / 3290
页数:20
相关论文
共 82 条
[1]  
Assmann P., 1988, 7th FASE Symposium. Proceedings Speech '88, P531
[2]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH THE SAME FUNDAMENTAL-FREQUENCY [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (01) :327-338
[3]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH DIFFERENT FUNDAMENTAL FREQUENCIES [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (02) :680-697
[4]  
Bregman A. S., 1990, AUDITORY SCENE ANAL, DOI 10.1121/1.408434
[5]   INTONATION AND THE PERCEPTUAL SEPARATION OF SIMULTANEOUS VOICES [J].
BROKX, JPL ;
NOOTEBOOM, SG .
JOURNAL OF PHONETICS, 1982, 10 (01) :23-36
[6]   TEMPORAL CODING OF RESONANCES BY LOW-FREQUENCY AUDITORY-NERVE FIBERS - SINGLE-FIBER RESPONSES AND A POPULATION-MODEL [J].
CARNEY, LH ;
YIN, TCT .
JOURNAL OF NEUROPHYSIOLOGY, 1988, 60 (05) :1653-1677
[7]   RESPONSES OF LOW-FREQUENCY CELLS IN THE INFERIOR COLLICULUS TO INTERAURAL TIME DIFFERENCES OF CLICKS - EXCITATORY AND INHIBITORY COMPONENTS [J].
CARNEY, LH ;
YIN, TCT .
JOURNAL OF NEUROPHYSIOLOGY, 1989, 62 (01) :144-161
[8]  
CARR CE, 1990, J NEUROSCI, V10, P3227
[9]   EFFECTS OF INTERAURAL TIME DELAYS OF NOISE STIMULI ON LOW-FREQUENCY CELLS IN THE CATS INFERIOR COLLICULUS .2. RESPONSES TO BAND-PASS FILTERED NOISES [J].
CHAN, JCK ;
YIN, TCT ;
MUSICANT, AD .
JOURNAL OF NEUROPHYSIOLOGY, 1987, 58 (03) :543-561