Speech recognition using a wavelet packet adaptive network based fuzzy inference system

被引:75
作者
Avci, Engin [1 ]
Akpolat, Zuhtu Hakan [1 ]
机构
[1] Firat Univ, Dept Elect & Comp Sci, Elazig 23119, Turkey
关键词
wavelet packet adaptive network based fuzzy inference system; speech recognition; speech/voice signal; feature extraction; wavelet packet decomposition; entropy; expert system;
D O I
10.1016/j.eswa.2005.09.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an expert speech recognition system is presented. This paper especially deals with the combination of feature extraction and classification for real speech signals. A Wavelet packet adaptive network based fuzzy inference system (WPANFIS) model is developed in this study. WPANFIS consists of two layers: wavelet packet and adaptive network based fuzzy inference system. The wavelet packet layer is used for adaptive feature extraction in the time-frequency domain and is composed of wavelet packet decomposition and wavelet packet entropy. The performance of the developed system is evaluated by using noisy speech signals. Test results showing the effectiveness of the proposed speech recognition system are presented in the paper. The rate of correct classification is about 92% for the sample speech signals. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:495 / 503
页数:9
相关论文
共 33 条
  • [21] APPLICATIONS OF VOICE PROCESSING TO TELECOMMUNICATIONS
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1994, 82 (02) : 199 - 228
  • [22] COMPARATIVE PERFORMANCE STUDY OF SEVERAL PITCH DETECTION ALGORITHMS
    RABINER, LR
    CHENG, MJ
    ROSENBERG, AE
    MCGONEGAL, CA
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 399 - 418
  • [23] SAITO N, 1994, THESIS YALE U
  • [24] High resolution speech feature parametrization for monophone-based stressed speech recognition
    Sarikaya, R
    Hansen, JHL
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2000, 7 (07) : 182 - 185
  • [25] SCOFIELD MC, 1991, NEURAL NETWORKS SPEE
  • [26] SIAFARIKAS M, 2004, WAVELET PACKET BASED
  • [27] Review of wavelet transforms for pattern recognitions
    Szu, HH
    [J]. WAVELET APPLICATIONS III, 1996, 2762 : 2 - 22
  • [28] Tufekci Z., 2000, PROC IEEE SOUTHEASTC, P116
  • [29] A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments
    Visser, E
    Otsuka, M
    Lee, TW
    [J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 393 - 407
  • [30] WESTFRIED E, 1993, IEEE SP, V41, P3597