APPLICATION OF CLUSTERING TECHNIQUES TO SPEAKER-TRAINED ISOLATED WORD RECOGNITION

被引:6
作者
RABINER, LR
WILPON, JG
机构
来源
BELL SYSTEM TECHNICAL JOURNAL | 1979年 / 58卷 / 10期
关键词
D O I
10.1002/j.1538-7305.1979.tb02964.x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker‐trained, isolated word recognizers have achieved notable success in a wide variety of applications. The training for such systems generally involves a single (or sometimes two) replication(s) of each word of the vocabulary by the designated talker. Word reference templates are then formed directly from these replications. In recent work on speaker‐independent word recognition, it has been shown that statistical clustering procedures provided an effective way for determining the structure in multiple replications of a word by different talkers. Such techniques were then used to provide a set of reference templates based on the clustering results. In this paper we discuss the application of clustering techniques to speaker‐trained word recognizers. It is shown that significant improvements in recognition accuracy are obtained when using templates obtained from a clustering analysis of multiple replications of a word by the designated talker. It is also shown that recognition accuracy did not change with time (over a 6‐month period) for any of the subjects tested, thereby indicating that the reference templates were reasonably stable. © 1979 The Bell System Technical Journal
引用
收藏
页码:2217 / 2233
页数:17
相关论文
共 23 条
[1]  
GOLD B, 1966, MIT452 RES LAB EL TE
[2]   OBJECTIVE AND SUBJECTIVE PERFORMANCE OF TANDEM CONNECTIONS OF WAVEFORM CODERS WITH AN LPC VOCODER [J].
GOODMAN, DJ ;
SCAGLIOLA, C ;
CROCHIERE, RE ;
RABINER, LR ;
GOODMAN, J .
BELL SYSTEM TECHNICAL JOURNAL, 1979, 58 (03) :601-629
[3]  
HERSCHER MB, 1972, 1972 C SPEECH COMM P, P89
[4]  
HYDE SR, 1972, HUMAN COMMUNICATION, P399
[5]   EVALUATION OF VARIOUS PARAMETER SETS IN SPOKEN DIGITS RECOGNITION [J].
ICHIKAWA, A ;
NAKANO, Y ;
NAKATA, K .
IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (03) :202-209
[6]   MINIMUM PREDICTION RESIDUAL PRINCIPLE APPLIED TO SPEECH RECOGNITION [J].
ITAKURA, F .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01) :67-72
[7]   INTERACTIVE CLUSTERING TECHNIQUES FOR SELECTING SPEAKER-INDEPENDENT REFERENCE TEMPLATES FOR ISOLATED WORD RECOGNITION [J].
LEVINSON, SE ;
RABINER, LR ;
ROSENBERG, AE ;
WILPON, JG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :134-141
[8]  
MAKHOUL JR, 1974, BBN2976 REP
[9]  
MARTIN TB, 1976, P IEEE, V64, P487, DOI 10.1109/PROC.1976.10157
[10]   SPEAKER-INDEPENDENT RECOGNITION OF ISOLATED WORDS USING CLUSTERING TECHNIQUES [J].
RABINER, LR ;
LEVINSON, SE ;
ROSENBERG, AE ;
WILPON, JG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (04) :336-349