Robust speech recognition method based on discriminative environment feature extraction

被引：11

作者：

Han, JQ ^{[1
]}

Gao, W

机构：

[1] Harbin Inst Technol, Dept Comp Engn & Sci, Harbin 150001, Peoples R China

[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2001年 / 16卷 / 05期

基金：

中国国家自然科学基金;

关键词：

robust speech recognition; minimum classification error; environmental parameter; discriminative learning;

D O I：

10.1007/BF02948964

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 [计算机科学与技术];

摘要：

It is an effective approach to learn the influence of environmental parameters, such as additive noise and channel distortions, from training data for robust speech recognition. Most of the previous methods are based on maximum likelihood estimation criterion. However, these methods do not lead to a minimum error rate result. In this paper, a novel discriminative learning method of environmental parameters, which is based on Minimum Classification Error (MCE) criterion, is proposed. In the method, a simple classifier and the Generalized Probabilistic Descent (GPD) algorithm are adopted to iteratively learn the environmental parameters. Consequently, the clean speech features are estimated from the noisy speech features with the estimated environmental parameters, and then the estimations of clean speech features are utilized in the back-end HMM classifier. Experiments show that the best error rate reduction of 32.1% is obtained, tested on a task of 18 isolated confusion Korean words, relative to a conventional HMM system.

引用

页码：458 / 464

页数：7

共 12 条

[1]

ACERO A, 1990, P IEEE INT C AC SPEE, P849

[2]

BIEM A, 1993, P IEEE 1993 INT C AC, pII275

[3]

BIEM A, 1994, P IEEE 1994 INT C AC, pI485

[4]

CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].

FURUI, S .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272

[5]

ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION [J].

GALES, MJF ;

YOUNG, SJ .

COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04) :289-307

[6]

HAN J, 1997, P EUR C SPEECH COMM, P1531

[7]

Han JQ, 1998, INT CONF ACOUST SPEE, P81, DOI 10.1109/ICASSP.1998.674372

[8]

Minimum classification error rate methods for speech recognition [J].

Juang, BH ;

Chou, W ;

Lee, CH .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03) :257-265

[9]

DISCRIMINATIVE LEARNING FOR MINIMUM ERROR CLASSIFICATION [J].

JUANG, BH ;

KATAGIRI, S .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (12) :3043-3054

[10]

ALGORITHM FOR VECTOR QUANTIZER DESIGN [J].

LINDE, Y ;

BUZO, A ;

GRAY, RM .

IEEE TRANSACTIONS ON COMMUNICATIONS, 1980, 28 (01) :84-95

← 1 2 →