Speech enhancement by residual domain constrained optimization

被引：9

作者：

Jin, Wen ^{[1
]}

Scordilis, Michael S. ^{[1
]}

机构：

[1] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33146 USA

来源：

SPEECH COMMUNICATION | 2006年 / 48卷 / 10期

关键词：

speech enhancement; linear prediction; constrained optimization;

D O I：

10.1016/j.specom.2006.07.001

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A new algorithm for the enhancement of speech corrupted by additive noise is proposed. This algorithm estimates the linear prediction residuals of the clean speech using a constrained optimization criterion. The signal distortion is minimized in the residual domain subject to a constraint on the average power of the noise residuals. Enhanced speech is obtained by exciting the time-varying all-pole synthesis filter with the estimated residuals of the clean speech. The proposed method was tested with speech corrupted by both white Gaussian and colored noise. The enhancement performances were evaluated in terms of segmental signal-to-noise ratio (SNR) and ITU-PESQ scores. Experimental results indicate our method yields better enhancement results than a former residual-weighting scheme [Yegnanarayana, B., Avendano, C., Hermansky, H., Murthy P.S., 1999. Speech enhancement using linear prediction residual. Speech Commun. 28, 25-42]. The proposed method also achieves better noise reduction than the time-domain subspace method [Ephraim, Y., Van Trees, H.L., 1995. A signal subspace approach for speech enhancement. IEEE Trans. Speech Audio Process. 3, 251-266] on real world colored noise. (c) 2006 Elsevier B.V. All rights reserved.

引用

页码：1349 / 1364

页数：16

共 10 条

[1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[2] A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].

EPHRAIM, Y ;

VANTREES, HL .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266

[3] FILTERING OF COLORED NOISE FOR SPEECH ENHANCEMENT AND CODING [J].

GIBSON, JD ;

KOO, BR ;

GRAY, SD .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (08) :1732-1742

[4] A perceptually motivated approach for speech enhancement [J].

Hu, Y ;

Loizou, PC .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :457-465

[5] REDUCTION OF BROAD-BAND NOISE IN SPEECH BY TRUNCATED QSVD [J].

JENSEN, SH ;

HANSEN, PC ;

HANSEN, SD ;

SORENSEN, JA .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06) :439-448

[6] Extension of the signal subspace speech enhancement approach to colored noise [J].

Lev-Ari, H ;

Ephraim, Y .

IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (04) :104-106

[7] Signal/noise KLT based approach for enhancing speech degraded by colored noise [J].

Mittal, U ;

Phamdo, N .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02) :159-167

[8]

Quackenbush S., 1988, Objective Measures of Speech Quality

[9] An adaptive KLT approach for speech enhancement [J].

Rezayee, A ;

Gazor, S .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02) :87-95

[10] Speech enhancement using linear prediction residual [J].

Yegnanarayana, B ;

Avendano, C ;

Hermansky, H ;

Murthy, PS .

SPEECH COMMUNICATION, 1999, 28 (01) :25-42

← 1 →