CONSTRAINED ITERATIVE SPEECH ENHANCEMENT WITH APPLICATION TO SPEECH RECOGNITION

被引:136
作者
HANSEN, JHL [1 ]
CLEMENTS, MA [1 ]
机构
[1] GEORGIA INST TECHNOL,SCH ELECT ENGN,ATLANTA,GA 30332
关键词
D O I
10.1109/78.80901
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an improved form of iterative speech enhancement for single channel inputs is formulated. The basis of the procedure is sequential maximum a posteriori estimation of the speech waveform and its all-pole parameters as originally formulated by Lim and Oppenheim, followed by imposition of constraints upon the sequence of speech spectra. The new approaches impose intraframe and interframe constaints on the input speech signal to ensure more speech-like formant trajectories, reduce frame-to-frame pole jitter, and effectively introduce a relaxation parameter to the iterative scheme. Recently discovered properties of the line spectral pair representation of speech allow for an efficient and direct procedure for application of many of the constraint requirements. Substantial improvement over the unconstrained method has been observed in a variety of domains. First, informal listener quality evaluation tests and objective speech quality measures demonstrate the technique's effectiveness for additive white Gaussian noise. A consistent terminating point for the iterative technique is also shown. Second, the algorithms have been generalized and successfully tested for noise which is nonwhite and slowly varying in characteristics. The current systems result in substantially improved speech quality and LPC parameter estimation in this context with only a minor increase in computational requirements. Third, the algorithms were evaluated with respect to improving automatic recognition of speech in the presence of additive noise, and shown to out-perform other enhancement methods in this application.
引用
收藏
页码:795 / 805
页数:11
相关论文
共 18 条
  • [1] [Anonymous], 1988, MODERN SPECTRAL ESTI
  • [2] Boll SF, 1979, T ACOUST SPEECH SIGN, V27, P113
  • [3] CROSMER J, 1985, THESIS GEORGIA I TEC
  • [4] HANSEN JH, 1987, 1987 P IEEE INT C AC, P189
  • [5] HANSEN JHL, 1985, DSPL856 GEORG I TECH
  • [6] HANSEN JHL, 1988, 1988 P IEEE ICASSP N
  • [7] HANSEN JHL, 1985, 110TH P AC SOC AM M
  • [8] HANSEN JHL, 1989, 1989 P IEEE INT C AC, P266
  • [9] ITAKURA F, 1975, J ACOUST SOC AM, V57
  • [10] LIM JS, 1978, T ACOUST SPEECH SIGN, V26, P197