A discriminative training algorithm for hidden Markov models

被引：28

作者：

Ben-Yishai, A ^{[1
]}

Burshtein, D ^{[1
]}

机构：

[1] Tel Aviv Univ, Dept Elect Engn Syst, IL-69978 Tel Aviv, Israel

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 03期

关键词：

discriminative training; hidden Markov model (HMM); maximum mutual information (MMI) criterion;

D O I：

10.1109/TSA.2003.822639

中图分类号：

O42 [声学];

学科分类号：

070206 [声学]; 082403 [水声工程];

摘要：

We introduce a discriminative training algorithm for the estimation of hidden Markov model (HMM) parameters. This algorithm is based on an approximation of the maximum mutual information (MMI) objective function and its maximization in a technique similar to the expectation-maximization (EM) algorithm. The algorithm is implemented by a simple modification of the standard Baum-Welch algorithm, and can be applied to speech recognition as well as to word-spotting systems. Three tasks were tested: Isolated digit recognition in a noisy environment, connected digit recognition in a noisy environment and word-spotting. In all tasks a significant improvement over maximum likelihood (ML) estimation was observed. We also compared the new algorithm to the commonly used extended Baum-Welch MMI algorithm. In our tests the algorithm showed advantages in terms of both performance and computational complexity.

引用

页码：204 / 217

页数：14

共 19 条

[1]

Bahl L., 1986, INT C ACOUSTICS SPEE, P49

[2]

BAHL LR, 1988, P ICASSP 88 NEW YORK, P493

[3]

A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].

BAUM, LE ;

PETRIE, T ;

SOULES, G ;

WEISS, N .

ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&

[4]

MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[5]

AN INEQUALITY FOR RATIONAL FUNCTIONS WITH APPLICATIONS TO SOME STATISTICAL ESTIMATION PROBLEMS [J].

GOPALAKRISHNAN, PS ;

KANEVSKY, D ;

NADAS, A ;

NAHAMOO, D .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (01) :107-113

[6]

Minimum classification error rate methods for speech recognition [J].

Juang, BH ;

Chou, W ;

Lee, CH .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03) :257-265

[7]

KAPADIA S, 1993, P ICASSP, V2, P491

[8]

Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method [J].

Katagiri, S ;

Juang, BH ;

Lee, CH .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2345-2373

[9]

LEONARD RG, 1984, P ICASSP 84

[10]

ON A MODEL-ROBUST TRAINING METHOD FOR SPEECH RECOGNITION [J].

NADAS, A ;

NAHAMOO, D ;

PICHENY, MA .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (09) :1432-1436

← 1 2 →