An adaptive KLT approach for speech enhancement

被引：162

作者：

Rezayee, A ^{[1
]}

Gazor, S ^{[1
]}

机构：

[1] Isfahan Univ Technol, Dept Elect & Comp Engn, Esfahan, Iran

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 02期

关键词：

adaptive estimation; adaptive filters; adaptive speech processing; adaptive voice activity detection; music quality enhancement; speech enhancement; speech subspace tracking;

D O I：

10.1109/89.902276

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An adaptive Karhunen-Loeve transform (KLT) tracking-based algorithm is proposed for enhancement of speech degraded by colored additive interference. This algorithm decomposes noisy speech into its components along the axes of a KLT-based vector space of clean speech. It is observed that the noise energy is disparately distributed along each eigenvector. These energies are obtained from noise samples gathered from silence intervals between speech samples. To obtain these silence intervals, we proposed an efficient voice activity detector based on outputs of principle component eigenfilter; the greatest eigenvalue of speech KLT. Enhancement is performed by modifying each KLT component due to its noise and clean speech energies. The objective is to minimize the produced distortion when residual noise power is limited to a specific level. At the end, inverse KLT is performed and an estimation of the clean signal is synthesized. Our listening tests indicated that 71% of our subjects preferred the enhanced speech by the above method over former methods of enhancement of speech degraded by computer generated white Gaussian noise. Our method was preferred by 80% of our subjects when we processed real samples of noisy speech gathered from various environments.

引用

页码：87 / 95

页数：9

共 20 条

[1]

[Anonymous], P IEEE INT C AC SPEE

[2] An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying [J].

Cappe, O ;

Mokbel, CE ;

Jouvet, D ;

Moulines, E .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01) :61-70

[3] The past, present, and future of speech processing [J].

Childers, D ;

Cox, RV ;

DeMori, R ;

Furui, S ;

Juang, BH ;

Mariani, JJ ;

Price, P ;

Sagayama, S ;

Sondhi, MM ;

Weischedel, R .

IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (03) :24-48

[4] STATISTICAL-MODEL-BASED SPEECH ENHANCEMENT SYSTEMS [J].

EPHRAIM, Y .

PROCEEDINGS OF THE IEEE, 1992, 80 (10) :1526-1555

[5] A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].

EPHRAIM, Y ;

VANTREES, HL .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266

[6]

FERGUSON J D., 1980, APPL HIDDEN MARKOV M

[7] Iterative and sequential Kalman filter-based speech enhancement algorithms [J].

Gannot, S ;

Burshtein, D ;

Weinstein, E .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (04) :373-385

[8]

GLOUB GH, 1989, MATRIX COMPUTATIONS

[9] Instrumental variable subspace tracking using projection approximation [J].

Gustafsson, T .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (03) :669-681

[10] REDUCTION OF BROAD-BAND NOISE IN SPEECH BY TRUNCATED QSVD [J].

JENSEN, SH ;

HANSEN, PC ;

HANSEN, SD ;

SORENSEN, JA .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06) :439-448

← 1 2 →