Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions

被引：45

作者：

Breithaupt, Colin ^{[1
]}

Martin, Rainer ^{[1
]}

机构：

[1] Ruhr Univ Bochum, Inst Commun Acoust, Dept Elect Engn & Informat Sci, D-44780 Bochum, Germany

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 02期

关键词：

Musical noise; single-channel noise reduction; spectral estimation; speech enhancement; GAMMA;

D O I：

10.1109/TASL.2010.2047681

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Because of their many applications and their relative ease of implementation, single-channel speech enhancement algorithms have received much attention. As a consequence, a vast amount of publications on estimation procedures and their implementation in noise reduction systems exists. However, there has been little systematic research on the theoretic performance of such estimators. In this paper, we provide a systematic analysis of the performance of noise reduction algorithms in low signal-to-noise ratio (SNR) and transient conditions, where we consider approaches using the well-known decision-directed SNR estimator. We show that the smoothing properties of the decision-directed SNR estimator in low SNR conditions can be analytically described and that the limits of noise reduction for widely used spectral speech estimators based on the decision-directed approach can be predicted. We also illustrate that achieving both a good preservation of speech onsets in transient conditions on one side and the suppression of musical noise on the other can be especially problematic when the decision-directed SNR estimation is used.

引用

页码：277 / 289

页数：13

共 25 条

[1] A modular approach to speech enhancement with an application to speech coding [J].

Accardi, AJ ;

Cox, RV .

ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :201-204

[2] Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors [J].

Andrianakis, I. ;

White, P. R. .

SPEECH COMMUNICATION, 2009, 51 (01) :1-14

[3]

[Anonymous], 2007, Speech Enhancement: Theory and Practice

[4] Parameterized MMSE spectral magnitude estimation for the enhancement of noisy speech [J].

Breithaupt, Colin ;

Krawczyk, Martin ;

Martin, Rainer .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4037-4040

[5] A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing [J].

Breithaupt, Colin ;

Gerkmann, Timo ;

Martin, Rainer .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4897-4900

[6] Cepstral smoothing of spectral filter gains for speech enhancement without musical noise [J].

Breithaupt, Colin ;

Gerkmann, Timo ;

Martin, Rainer .

IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (12) :1036-1039

[7] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].

Cappe, Olivier .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349

[8] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[9] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[10]

Ephraim Y., 2005, The Electrical Engineering Handbook

← 1 2 3 →