SPEAKER-INDEPENDENT DIGIT RECOGNITION USING A NEURAL NETWORK WITH TIME-DELAYED CONNECTIONS

被引:9
作者
UNNIKRISHNAN, KP
HOPFIELD, JJ
TANK, DW
机构
[1] AT&T BELL LABS,MOLEC BIOPHYS RES DEPT,MURRAY HILL,NJ 07974
[2] CALTECH,DIV CHEM,PASADENA,CA 91125
[3] CALTECH,DIV BIOL,PASADENA,CA 91125
关键词
D O I
10.1162/neco.1992.4.1.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The capability of a small neural network to perform speaker-independent recognition of spoken digits in connected speech has been investigated. The network uses time delays to organize rapidly changing outputs of symbol detectors over the time scale of a word. The network is data driven and unclocked. To achieve useful accuracy in a speaker-independent setting, many new ideas and procedures were developed. These include improving the feature detectors, self-recognition of word ends, reduction in network size, and dividing speakers into natural classes. Quantitative experiments based on Texas Instruments (TI) digit data bases are described.
引用
收藏
页码:108 / 119
页数:12
相关论文
共 9 条
[1]   NETWORK-BASED CONNECTED DIGIT RECOGNITION [J].
BUSH, MA ;
KOPEC, GE .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (10) :1401-1413
[2]   LEARNING ALGORITHMS AND PROBABILITY-DISTRIBUTIONS IN FEEDFORWARD AND FEEDBACK NETWORKS [J].
HOPFIELD, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (23) :8429-8433
[3]  
LEONARD GE, 1984, P INT C ACOUSTICS SP, V3
[4]   AN INTRODUCTION TO THE APPLICATION OF THE THEORY OF PROBABILISTIC FUNCTIONS OF A MARKOV PROCESS TO AUTOMATIC SPEECH RECOGNITION [J].
LEVINSON, SE ;
RABINER, LR ;
SONDHI, MM .
BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (04) :1035-1074
[5]  
RABINER LR, 1988, P ICASSP NEW YORK, P119
[6]  
TANK DW, 1987, 1ST P IEEE INT C NEU
[7]   CONNECTED-DIGIT SPEAKER-DEPENDENT SPEECH RECOGNITION USING A NEURAL NETWORK WITH TIME-DELAYED CONNECTIONS [J].
UNNIKRISHNAN, KP ;
HOPFIELD, JJ ;
TANK, DW .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (03) :698-713
[8]  
UNNIKRISHNAN KP, 1988, NEURAL NETWORKS COMP
[9]   PHONEME RECOGNITION USING TIME-DELAY NEURAL NETWORKS [J].
WAIBEL, A ;
HANAZAWA, T ;
HINTON, G ;
SHIKANO, K ;
LANG, KJ .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (03) :328-339