STATISTICAL-MODEL-BASED SPEECH ENHANCEMENT SYSTEMS

被引:199
作者
EPHRAIM, Y [1 ]
机构
[1] GEORGE MASON UNIV, C3I RES CTR, FAIRFAX, VA 22030 USA
关键词
D O I
10.1109/5.168664
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech enhancement has been a challenge for many researchers for almost three decades. The problem involves improving the performance of speech communication systems in noisy environments. Since the statistics of the speech signal as well as of the noise are not explicitly available, and the most perceptually meaningful distortion measure is not known, model-based approaches have recently been extensively studied and applied to the three basic problems of speech enhancement. These problems comprise 1) signal estimation from a given sample function of noisy speech, 2) signal coding when only noisy speech is available, and 3) recognition of noisy speech signals in man-machine communication. In this paper, the recent research on the model-based approach is integrated and put into perspective with other more traditional approaches for speech enhancement. A unified statistical approach for the three basic problems of speech enhancement is developed using composite source models for the signal and noise and a fairly large set of distortion measures.
引用
收藏
页码:1526 / 1555
页数:30
相关论文
共 177 条
[51]  
Gray R. M., 1984, IEEE ASSP Magazine, V1, P4, DOI 10.1109/MASSP.1984.1162229
[52]   DISTORTION MEASURES FOR SPEECH PROCESSING [J].
GRAY, RM ;
BUZO, A ;
GRAY, AH ;
MATSUYAMA, Y .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :367-376
[53]   RATE-DISTORTION SPEECH CODING WITH A MINIMUM DISCRIMINATION INFORMATION DISTORTION MEASURE [J].
GRAY, RM ;
GRAY, AH ;
REBOLLEDO, G ;
SHORE, JE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1981, 27 (06) :708-721
[54]   MULTIPLE LOCAL OPTIMA IN VECTOR QUANTIZERS [J].
GRAY, RM ;
KARNIN, ED .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1982, 28 (02) :256-261
[55]  
GRAY RM, 1977, 65041 STANF EL LAB T
[56]  
GRAY RM, 1980, INFORMATION CONT MAY, P178
[57]  
GRAY RM, 1988, IEEE T ACOUST SPEECH, V34, P1033
[58]  
GRENANDER U, 1984, TOEPLITZ FORMS THEIR
[59]   CONSTRAINED ITERATIVE SPEECH ENHANCEMENT WITH APPLICATION TO SPEECH RECOGNITION [J].
HANSEN, JHL ;
CLEMENTS, MA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :795-805
[60]  
HANSEN JHL, 1987, APR P IEEE INT C AC, P189