A genetic classification method for speaker recognition

被引:19
作者
Hong, QY [1 ]
Kwong, S [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
Gaussian mixture model; speaker identification; genetic algorithm;
D O I
10.1016/j.engappai.2004.08.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian mixture model (GMM) has been widely used for modeling speakers. In speaker identification, one major problem is how to generate a set of GMMs for identification purposes based upon the training data. Due to the hill-climbing characteristic of the maximum likelihood (ML) method, any arbitrary estimate of the initial model parameters will usually lead to a sub-optimal model in practice. To resolve this problem, this paper proposes a hybrid training method based on genetic algorithm (GA). It utilizes the global searching capability of GA and combines the effectiveness of the ML method. Experimental results based on TI46 and TIMIT showed that this hybrid GA could obtain more optimized GMMs and better results than the simple GA and the traditional ML method. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:13 / 19
页数:7
相关论文
共 19 条
[1]  
CHAU CW, 1997, P ICASSP, V3, P1727
[2]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[3]   Speaker Recognition Using Neural Networks and Conventional Classifiers [J].
Farrell, Kevin R. ;
Mammone, Richard J. ;
Assaleh, Khaled T. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01) :194-205
[4]  
GAROFOLO JS, 1986, TIMIT ACOUSTIC PHONE
[5]  
HATTORI H, 1992, P IEEE INT C AC SPEE, V2, P153
[6]   An improved maximum model distance approach for HMM-based speech recognition systems [J].
He, QH ;
Kwong, S ;
Man, KF ;
Tang, KS .
PATTERN RECOGNITION, 2000, 33 (10) :1749-1758
[7]   Optimisation of HMM topology and its model parameters by genetic algorithms [J].
Kwong, S ;
Chau, CW ;
Man, KF ;
Tang, KS .
PATTERN RECOGNITION, 2001, 34 (02) :509-522
[8]  
LIBERMAN M, 1980, T146 WORD
[9]  
MAN KF, 1979, GENETIC ALGORITHMS C
[10]  
MATSUI T, 1992, P ICASSP MARCH