ON THE USE OF A METRIC-SPACE SEARCH ALGORITHM (AESA) FOR FAST DTW-BASED RECOGNITION OF ISOLATED WORDS

被引:19
作者
VIDAL, E [1 ]
RULOT, HM [1 ]
CASACUBERTA, F [1 ]
BENEDI, JM [1 ]
机构
[1] UNIV VALENCIA, CTR COMP, VALENCIA, SPAIN
来源
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1988年 / 36卷 / 05期
关键词
MATHEMATICAL PROGRAMMING; DYNAMIC - SIGNAL FILTERING AND PREDICTION - Computer Applications;
D O I
10.1109/29.1575
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The approximating and eliminating search algorithm (AESA) presented was recently introduced for finding nearest neighbors in metric spaces. Although the AESA was originally developed for reducing the time complexity of dynamic time-warping isolated word recognition (DTW-IWR), only rather limited experiments had been previously carried out to check its performance in this task. A set of experiments aimed at filling this gap is reported. The main results show that the important features reflected in previous simulation experiments are exist for real speech samples. With single-speaker dictionaries of up to 200 words, and for most of the different speech parameterizations, local metrics, and DTW productions tested, the AESA consistently found the appropriate prototype while requiring only an average of 7-12 DTW computations (94-96% savings for 200 words), with a strong tendency to need fewer computations if the samples are close to their corresponding prototypes.
引用
收藏
页码:651 / 660
页数:10
相关论文
共 43 条
[1]  
ALVES DS, 1984, 4EME P C RFIA PAR, P511
[2]  
[Anonymous], [No title captured]
[3]  
Aull A. M., 1985, P ICASSP, P1549
[4]  
Bellman R., 1972, DYNAMIC PROGRAMMING
[5]  
Bisiani R., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P570
[6]  
BUKHARD WA, 1973, COMMUN ASS COMPUT MA, V16, P230
[7]   ON THE METRIC PROPERTIES OF DYNAMIC TIME WARPING [J].
CASACUBERTA, F ;
VIDAL, E ;
RULOT, H .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (11) :1631-1633
[8]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[9]  
DELANNOY C, 1980, RAIRO-INF-COMPUT SCI, V14, P275
[10]  
DIVOUX P, 1985, 5EME P AFCET C RFIA