Speech enhancement using state-based estimation and sinusoidal modeling

被引:8
作者
Deisher, ME
Spanias, AS
机构
[1] Department of Electrical Engineering, Telecommunications Research Center, Arizona State University, Tempe
关键词
D O I
10.1121/1.419866
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A procedure for estimating the parameters of a sinusoidal model from speech corrupted by additive noise is described. An approximate harmonic representation is used wherein voiced speech is represented by a set of sine waves at multiples of the fundamental frequency and several additional components at frequencies near each harmonic. Amplitudes and phases of the sinusoidal components are estimated using a state-based technique that employs hidden Markov models (HMMs) to classify speech and noise spectra. Voicing and fundamental frequency are determined using an analysis-by-synthesis approach. Simulation results are presented, comparing the performance of the proposed algorithm to that of the standard HMM-based minimum mean square error (MMSE) estimator. The proposed method was found to reduce the structured residual noise associated with HMM-based algorithms. (C) 1997 Acoustical Society of America.
引用
收藏
页码:1141 / 1148
页数:8
相关论文
共 16 条
[1]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[2]  
DEISHER ME, 1996, THESIS ARIZONA STATE
[3]  
EPHRAIM Y, 1990, P IEEE INT C AC SPEE, V2, P829
[4]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[5]  
Garofolo J. S., 1988, Tech. rep.
[6]   ENHANCEMENT AND BANDWIDTH COMPRESSION OF NOISY SPEECH [J].
LIM, JS ;
OPPENHEIM, AV .
PROCEEDINGS OF THE IEEE, 1979, 67 (12) :1586-1604
[7]  
MACAULAY RJ, 1992, ADV SPEECH SIGNAL PR, P165
[8]  
MCAULAY RJ, 1990, P ICASSP, P249
[9]   AN APPROACH TO CO-CHANNEL TALKER INTERFERENCE SUPPRESSION USING A SINUSOIDAL MODEL FOR SPEECH [J].
QUATIERI, TF ;
DANISEWICZ, RG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (01) :56-69
[10]  
QUATIERI TF, 1990, P IEEE INT C AC SPEE, P821