Phoneme recognition using wavelet based features

被引:24
作者
Farooq, O [1 ]
Datta, S [1 ]
机构
[1] Univ Loughborough, Dept Elect & Elect Engn, Loughborough LE11 3TU, Leics, England
关键词
feature extraction; discrete wavelet transform; multi-layer perceptron;
D O I
10.1016/S0020-0255(02)00366-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the use of the discrete wavelet transform (DWT) for the extraction of features from phonemes. Instead of using the short time Fourier transform for feature extraction a new set of features is obtained from the DWT. The new set of features overcomes the previously reported problem of shift variance in DWT based features. Training and test samples of the phonemes were obtained from the TIMIT database. To account for the fast changes in the phonemes, the features were calculated for different phoneme durations and the performance was compared. For the classification of the phonemes, two different classifiers were used, based on linear discriminant analysis and multi-layer perceptron. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:5 / 15
页数:11
相关论文
共 15 条
[1]  
BUCKHEIT JB, 1995, P SOC PHOTO-OPT INS, V2569, P540, DOI 10.1117/12.217608
[2]   Speech feature extracted from adaptive wavelet for speech recognition [J].
Chang, SW ;
Kwon, Y ;
Yang, SI .
ELECTRONICS LETTERS, 1998, 34 (23) :2211-2213
[3]  
Coifman R. R., 1995, LECT NOTES STAT, V103, P125, DOI [DOI 10.1007/978-1-4612-2544-7_9, DOI 10.1002/CPA.3160410705, 10.1002/cpa.3160410705]
[4]   Hierarchical search for large-vocabulary conversational speech recognition - Working toward a solution to the decoding problem [J].
Deshmukh, N ;
Ganapatkiraju, A ;
Picone, J .
IEEE SIGNAL PROCESSING MAGAZINE, 1999, 16 (05) :84-107
[5]  
Long CJ, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P264, DOI 10.1109/ICSLP.1996.607095
[6]  
Long CJ, 1999, THESIS LOUGHBOROUGH
[7]  
Looney C. G., 1997, PATTERN RECOGNITION
[8]  
Mallat S., 2008, A wavelet tour of signal processing: The sparse way, Vthird
[9]   Dynamic programming search for continuous speech recognition [J].
Ney, H ;
Ortmanns, S .
IEEE SIGNAL PROCESSING MAGAZINE, 1999, 16 (05) :64-83
[10]  
Picone J. W., 1995, DIGIT SIGNAL PROCESS, V57, P101