Speech recognition using a wavelet packet adaptive network based fuzzy inference system

被引：75

作者：

Avci, Engin ^{[1
]}

Akpolat, Zuhtu Hakan ^{[1
]}

机构：

[1] Firat Univ, Dept Elect & Comp Sci, Elazig 23119, Turkey

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2006年 / 31卷 / 03期

关键词：

wavelet packet adaptive network based fuzzy inference system; speech recognition; speech/voice signal; feature extraction; wavelet packet decomposition; entropy; expert system;

D O I：

10.1016/j.eswa.2005.09.058

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, an expert speech recognition system is presented. This paper especially deals with the combination of feature extraction and classification for real speech signals. A Wavelet packet adaptive network based fuzzy inference system (WPANFIS) model is developed in this study. WPANFIS consists of two layers: wavelet packet and adaptive network based fuzzy inference system. The wavelet packet layer is used for adaptive feature extraction in the time-frequency domain and is composed of wavelet packet decomposition and wavelet packet entropy. The performance of the developed system is evaluated by using noisy speech signals. Test results showing the effectiveness of the proposed speech recognition system are presented in the paper. The rate of correct classification is about 92% for the sample speech signals. (c) 2005 Elsevier Ltd. All rights reserved.

引用

页码：495 / 503

页数：9

共 33 条

[1] [Anonymous], 1997, A Wavelet Tour of Signal Processing
[2] [Anonymous], P IEEE NORD SIGN PRO
[3] [Anonymous], P ICSLP 96 PHIL US
[4] [Anonymous], P 7 INT C SPOK LANG
[5] Image coding using wavelet transform
Antonini, Marc
Barlaud, Michel
Mathieu, Pierre
Daubechies, Ingrid
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1992, 1 (02) : 205 - 220
[6] AVCI E, 2005, EXPERT SYSTEMS APPL, V29
[7] Buckheit J., 1995, WAVELETS STAT
[8] BURRUS CS, 1998, INTRO WAVELET WAVELE
[9] THE CHALLENGE OF SPOKEN LANGUAGE SYSTEMS - RESEARCH DIRECTIONS FOR THE NINETIES
COLE, R
HIRSCHMAN, L
ATLAS, L
BECKMAN, M
BIERMANN, A
BUSH, M
CLEMENTS, M
COHEN, J
GARCIA, O
HANSON, B
HERMANSKY, H
LEVINSON, S
MCKEOWN, K
MORGAN, N
NOVICK, DG
OSTENDORF, M
OVIATT, S
PRICE, P
SILVERMAN, H
SPITZ, J
WAIBEL, A
WEINSTEIN, C
ZAHORIAN, S
ZUE, V
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 1 - 21
[10] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
DAVIS, SB
MERMELSTEIN, P
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366

← 1 2 3 4 →