On the use of orthogonal GMM in speaker recognition

被引:15
作者
Liu, L [1 ]
He, JL [1 ]
机构
[1] Arizona State Univ, Dept Speech & Hearing Sci, Tempe, AZ 85287 USA
来源
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年
关键词
D O I
10.1109/ICASSP.1999.759803
中图分类号
O42 [声学];
学科分类号
070206 [声学]; 082403 [水声工程];
摘要
The Gaussian mixture modeling (GMM) techniques are increasingly being used for both speaker identification and verification. Most of these models assume diagonal covariance matrices. Although empirically any distribution can be approximated with a diagonal GMM, a large number of mixture components are usually needed to obtain a good approximation. A consequence of using a large GMM is that its training is time consuming and its response speed is very slow. This paper proposes a modification to the standard diagonal GMM approach. The proposed scheme includes an orthogonal transformation: feature vectors are first transformed to the space spanned by the eigenvectors of the covariance matrix before applying to the diagonal GMM. Only a small computational load is introduced by this transformation, but results from both speaker identification and verification experiments indicated that the orthogonal transformation considerably improves the recognition performance. For a specific performance level, the GMM with orthogonal transform needs only one-fourth the number of Gaussian functions required by the standard GMM.
引用
收藏
页码:845 / 848
页数:4
相关论文
共 6 条
[1]
2ND-ORDER STATISTICAL MEASURES FOR TEXT-INDEPENDENT SPEAKER IDENTIFICATION [J].
BIMBOT, F ;
MAGRINCHAGNOLLEAU, I ;
MATHAN, L .
SPEECH COMMUNICATION, 1995, 17 (1-2) :177-192
[2]
GISH H, 1994, IEEE SIGNAL PROC OCT, P18
[3]
Godfrey J., 1994, ESCA Workshop on Automatic Speaker Recognition Identification and Verification, P39
[4]
SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA .
SPEECH COMMUNICATION, 1995, 17 (1-2) :91-108
[5]
ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83
[6]
YUO K, 1997, P EUR C RHOD GREEC, P2279