Local fuzzy PCA based GMM with dimension reduction on speaker identification

被引:34
作者
Lee, KY [1 ]
机构
[1] Soong Sil Univ, Sch Elect Engn, Seoul 156743, South Korea
关键词
PCA; GMM; fuzzy clustering; speaker identification; dimension reduction;
D O I
10.1016/j.patrec.2004.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1811 / 1817
页数:7
相关论文
共 15 条
[1]
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[2]
[Anonymous], PRINCIPAL COMPONENT
[3]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[4]
Semi-tied covariance matrices for hidden Markov models [J].
Gales, MJF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :272-281
[5]
UNSUPERVISED OPTIMAL FUZZY CLUSTERING [J].
GATH, I ;
GEVA, AB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :773-781
[6]
Gustafson D. E., 1979, Proceedings of the 1978 IEEE Conference on Decision and Control Including the 17th Symposium on Adaptive Processes, P761
[7]
Dimension reduction by local principal component analysis [J].
Kambhatla, N ;
Leen, TK .
NEURAL COMPUTATION, 1997, 9 (07) :1493-1516
[8]
On the use of orthogonal GMM in speaker recognition [J].
Liu, L ;
He, JL .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :845-848
[9]
THE IMPORTANCE OF CEPSTRAL PARAMETER CORRELATIONS IN SPEECH RECOGNITION [J].
LJOLJE, A .
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03) :223-232
[10]
Robust speaker recognition - A feature-based approach [J].
Mammone, RJ ;
Zhang, XY ;
Ramachandran, RP .
IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (05) :58-71