SPEAKER-INDEPENDENT RECOGNITION OF ISOLATED WORDS USING CLUSTERING TECHNIQUES

被引:83
作者
RABINER, LR
LEVINSON, SE
ROSENBERG, AE
WILPON, JG
机构
[1] Acoustics Research Department, Bell Laboratories, Murray Hill
来源
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1979年 / 27卷 / 04期
关键词
D O I
10.1109/TASSP.1979.1163259
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speaker-independent isolated word recognition system is described which is based on the use of multiple templates for each word in the vocabulary. The word templates are obtained from a statistical clustering analysis of a large database consisting of 100 replications of each word (i.e., once by each of 100 talkers). The recognition system, which accepts telephone quality speech input, is based on an LPC analysis of the unknown word, dynamic time warping of each reference template to the unknown word (using the Itakura LPC distance measure), and the application of a K-nearest neighbor (KNN) decision rule. Results for several test sets of data are presented. They show error rates that are comparable to, or better than, those obtained with speaker-trained isolated word recognition systems. Copyright © 1979 by The Institute of Electrical and Electronics Engineers, Inc.
引用
收藏
页码:336 / 349
页数:14
相关论文
共 29 条