STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement

被引：175

作者：

Krawczyk, Martin ^{[1
]}

Gerkmann, Timo

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Speech Signal Proc Grp, D-26111 Oldenburg, Germany

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2014年 / 22卷 / 12期

关键词：

Noise reduction; phase estimation; signal reconstruction; speech enhancement;

D O I：

10.1109/TASLP.2014.2354236

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The enhancement of speech which is corrupted by noise is commonly performed in the short-time discrete Fourier transform domain. In case only a single microphone signal is available, typically only the spectral amplitude is modified. However, it has recently been shown that an improved spectral phase can as well be utilized for speech enhancement, e. g., for phase-sensitive amplitude estimation. In this paper, we therefore present a method to reconstruct the spectral phase of voiced speech from only the fundamental frequency and the noisy observation. The importance of the spectral phase is highlighted and we elaborate on the reason why noise reduction can be achieved by modifications of the spectral phase. We show that, when the noisy phase is enhanced using the proposed phase reconstruction, instrumental measures predict an increase of speech quality over a range of signal to noise ratios, even without explicit amplitude enhancement.

引用

页码：1931 / 1940

页数：10

共 38 条

[1]

[Anonymous], PERC EV SPEECH QUAL, P862

[2]

[Anonymous], 1988, NAT I STANDARDS THEC

[3]

Brookes Mike., VOICEBOX: Speech Processing Toolbox for MATLAB

[4]

Charpentier F. J., 1986, ICASSP 86 Proceedings. IEEE-IECEJ-ASJ International Conference on Acoustics, Speech and Signal Processing (Cat. No.86CH2243-4), P113

[5] Joint fundamental frequency and order estimation using optimal filtering [J].

Christensen, Mads Graesboll ;

Hojvang, Jesper Lisby ;

Jakobsson, Andreas ;

Jensen, Soren Holdt .

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,

[6] Speech enhancement using state-based estimation and sinusoidal modeling [J].

Deisher, ME ;

Spanias, AS .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (02) :1141-1148

[7] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[8] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[9]

Gerkmann T., 2012, P IEEE CONV EL EL EN

[10] Bayesian Estimation of Clean Speech Spectral Coefficients Given a Priori Knowledge of the Phase [J].

Gerkmann, Timo .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (16) :4199-4208

← 1 2 3 4 →