Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition

被引:7
作者
Albesano, D [1 ]
Gemello, R [1 ]
Mana, F [1 ]
机构
[1] Ctr Studi & Lab Telecomun SpA, I-10148 Turin, Italy
关键词
D O I
10.1016/S0020-0255(99)00106-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
This paper describes the benefits in recognition accuracy that can be achieved in a hybrid Hidden Markov Model-Neural Network (HMM-NN) recognition framework by using context-dependent subword units named Stationary - Transitional Units. These units are made up of stationary parts of the context-independent phonemes plus all the admissible transitions between them; they have good generalization capability and capture a wide acoustic detail. These units are very suitable to be modeled with neural networks, can enhance the performances of hybrid HMM-NN systems, and represent a real alternative to the context-independent phonemes. The efficacy of Stationary,Transitional Units is verified for the Italian language on isolated and continuous speech recognition tasks extracted from a real application employed for railway timetable telephonic vocal access. The results show that a relevant improvement is achieved with respect to the use of the context-independent phonemes. (C) 2000 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 10 条
[1]
A robust system for human-machine dialogue in telephony-based applications [J].
Albesano D. ;
Baggia P. ;
Danieli M. ;
Gemello R. ;
Gerbino E. ;
Rullent C. .
International Journal of Speech Technology, 1997, 2 (2) :101-111
[2]
ALBESANO D, 1996, P IEEE NNSP WORKSH K
[3]
BOURLAD H, 1993, CONNECTIONIST SPEECH
[4]
FISSORE L, 1995, P EUROSPEECH 95 MADR, P799
[5]
FRANZINI MA, 1990, P INT C AC SPEECH SI, P425
[6]
GEMELLO R, 1995, 950230 CSELT
[7]
HAFFNER P, P ICASSP 91, P105
[8]
HOCHBERG MM, P ICASSP 95 DETR US, P69
[9]
Rabiner L., 1993, Fundamentals of Speech Recognition
[10]
AN APPLICATION OF RECURRENT NETS TO PHONE PROBABILITY ESTIMATION [J].
ROBINSON, AJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :298-305