Speech enhancement using a constrained iterative sinusoidal model

被引:72
作者
Jensen, J [1 ]
Hansen, JHL [1 ]
机构
[1] Aalborg Univ, Ctr PersonKommunikat, Aalborg, Denmark
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 07期
关键词
sinusoidal speech model; speech and noise; speech enhancement; speech quality;
D O I
10.1109/89.952491
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
This paper presents a sinusoidal model based algorithm for enhancement of speech degraded by additive broad-band noise. In order to ensure speech-like characteristics observed in clean speech, smoothness constraints are imposed on the model parameters using a spectral envelope surface (SES) smoothing procedure. Algorithm evaluation is performed using speech signals degraded by additive white Gaussian noise. Distortion as measured by objective speech quality scores showed a 34%-41% reduction over a SNR range of 5-to-20 dB. Objective and subjective evaluations also show considerable improvement over traditional spectral subtraction and Wiener filtering based schemes. Finally, in a subjective AB preference test, where enhanced signals were coded with the G729 codec, the proposed scheme was preferred over the traditional enhancement schemes tested for SNR's in the range of 5 to 20 dB.
引用
收藏
页码:731 / 740
页数:10
相关论文
共 26 条
[1]
Audio signal noise reduction using multi-resolution sinusoidal modeling [J].
Anderson, DV ;
Clements, MA .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :805-808
[2]
SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[3]
ON THE APPLICATION OF HIDDEN MARKOV-MODELS FOR ENHANCING NOISY SPEECH [J].
EPHRAIM, Y ;
MALAH, D ;
JUANG, BH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1846-1856
[4]
STATISTICAL-MODEL-BASED SPEECH ENHANCEMENT SYSTEMS [J].
EPHRAIM, Y .
PROCEEDINGS OF THE IEEE, 1992, 80 (10) :1526-1555
[5]
A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].
EPHRAIM, Y ;
VANTREES, HL .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266
[6]
ROBUST ESTIMATION OF SPEECH IN NOISY BACKGROUNDS BASED ON ASPECTS OF THE AUDITORY PROCESS [J].
HANSEN, JHL ;
NANDKUMAR, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (06) :3833-3849
[7]
CONSTRAINED ITERATIVE SPEECH ENHANCEMENT WITH APPLICATION TO SPEECH RECOGNITION [J].
HANSEN, JHL ;
CLEMENTS, MA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :795-805
[8]
OBJECTIVE SPEECH QUALITY ASSESSMENT AND THE RPE-LTP CODING ALGORITHM IN DIFFERENT NOISE AND LANGUAGE CONDITIONS [J].
HANSEN, JHL ;
NANDKUMAR, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :609-627
[9]
MARKOV MODEL-BASED PHONEME CLASS PARTITIONING FOR IMPROVED CONSTRAINED ITERATIVE SPEECH ENHANCEMENT [J].
HANSEN, JHL ;
ARSLAN, LM .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :98-104
[10]
ROBUST FEATURE-ESTIMATION AND OBJECTIVE QUALITY ASSESSMENT FOR NOISY SPEECH RECOGNITION USING THE CREDIT CARD CORPUS [J].
HANSEN, JHL ;
ARSLAN, LM .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (03) :169-184