ROBUST ESTIMATION OF SPEECH IN NOISY BACKGROUNDS BASED ON ASPECTS OF THE AUDITORY PROCESS

被引：21

作者：

HANSEN, JHL

NANDKUMAR, S

机构：

[1] Robust Speech Processing Laboratory, Electrical Engineering, Duke University, Durham, North Carolina 27708-0291

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 1995年 / 97卷 / 06期

关键词：

D O I：

10.1121/1.413108

中图分类号：

O42 [声学];

学科分类号：

070206 [声学]; 082403 [水声工程];

摘要：

A new approach to speech enhancement is proposed where constraints based on aspects of the auditory process augment an iterative enhancement framework. The basic enhancement framework is based on a previously developed dual-channel scenario using a two-step iterative Wiener filtering algorithm. Constraints across broad speech sections and over iterations are then experimentally developed on a novel auditory representation derived by transforming the speech magnitude spectrum. The spectral transformations are based on modeling aspects of the human auditory process which include critical band filtering, intensity-to-loudness conversion, and lateral inhibition. The auditory transformations and perceptual based constraints are shown to result in a new set of auditory constrained and enhanced linear prediction (ACE-LP) parameters. The ACE-LP based speech spectrum is then incorporated into the iterative Wiener filtering framework. The improvements due to auditory constraints are demonstrated in several areas. The proposed auditory representation is shown to result in improved spectral characterization in background noise. The auditory constrained iterative enhancement (ACE-II) algorithm is shown to result in improved quality over all sections of enhanced speech. Adaptation of auditory based constraints to changing spectral characteristics over broad classes of speech is another novel aspect of the proposed algorithm. The consistency of speech quality improvement for the ACE-II algorithm is illustrated over time and across all phonemes classified over a large set of phonetically balanced sentences from the TIMIT database. This study demonstrates the application of auditory based perceptual properties of a human listener to speech enhancement in noise, resulting in improved and consistent speech quality over all regions of speech. © 1995, Acoustical Society of America. All rights reserved.

引用

页码：3833 / 3849

页数：17

共 39 条

[1]

SPEECH ENHANCEMENT BASED CONCEPTUALLY ON AUDITORY EVIDENCE [J].