AN APPROACH TO CO-CHANNEL TALKER INTERFERENCE SUPPRESSION USING A SINUSOIDAL MODEL FOR SPEECH

被引:50
作者
QUATIERI, TF
DANISEWICZ, RG
机构
[1] Massachusetts Institute of Technology, Lincoln Laboratory, Lexington
[2] M.I.T. Lincoln Laboratory, Lexington, MA
来源
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1990年 / 38卷 / 01期
关键词
D O I
10.1109/29.45618
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a new approach to co-channel talker interference suppression based on a sinusoidal representation of speech. The technique fits a sinusoidal model to additive vocalic speech segments such that the least mean-squared error between the model and the summed waveforms is obtained. Enhancement is achieved by synthesizing a waveform from the sine waves attributed to the desired speaker. Least-squares estimation is applied to obtain sine-wave amplitudes and phases of both talkers, based on either a priori sine-wave frequencies or a priori fundamental frequency contours. When the frequencies of the two waveforms are closely spaced, the performance is significantly improved by exploiting the time evolution of the sinusoidal parameters across multiple analysis frames. The least-squared error approach is also extended, under restricted conditions, to estimate fundamental frequency contours of both speakers from the summed waveforms. The results obtained, although limited in their scope, provide evidence that the sinusoidal analysis/synthesis model with effective parameter estimation techniques offers a promising approach to the problem of co-channel talker interference suppression over a range of conditions. © 1990 IEEE
引用
收藏
页码:56 / 69
页数:14
相关论文
共 15 条
[1]  
CHILDERS DG, 1987, APR P INT C AC SPEEC, V1, P181
[2]  
DANISEWICZ RG, 1988, TR794 MIT LINC LAB T
[3]  
DANISEWICZ RG, 1987, THESIS MIT
[4]  
HANSON BA, 1983, ADA135702
[5]  
HANSON BA, 1984, APR P INT C AC SPEEC
[6]  
HANSON BA, 1983, APR P INT C AC SPEEC, P1122
[7]   SPEECH ANALYSIS SYNTHESIS BASED ON A SINUSOIDAL REPRESENTATION [J].
MCAULAY, RJ ;
QUATIERI, TF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04) :744-754
[8]  
NAYLOR J, 1987, APR P INT C AC SPEEC, V1, P205
[9]  
Oppenheim A. V., 1975, DIGITAL SIGNAL PROCE
[10]   SEPARATION OF SPEECH FROM INTERFERING SPEECH BY MEANS OF HARMONIC SELECTION [J].
PARSONS, TW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 (04) :911-918