AANN: an alternative to GMM for pattern recognition

被引：72

作者：

Yegnanarayana, B ^{[1
]}

Kishore, SP ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India

来源：

NEURAL NETWORKS | 2002年 / 15卷 / 03期

关键词：

autoassociative neural network models; training error surface; annealing gain parameter; speaker verification;

D O I：

10.1016/S0893-6080(02)00019-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The objective in any pattern recognition problem is to capture the characteristics common to each class from feature vectors of the training data. While Gaussian mixture models appear to be general enough to characterize the distribution of the given data, the model is constrained by the fact that the shape of the components of the distribution is assumed to be Gaussian, and the number of mixtures are fixed a priori. In this context, we investigate the potential of non-linear models such as autoassociative neural network (AANN) models, which perform identity mapping of the input space. We show that the training error surface realized by the neural network model in the feature space is useful to study the characteristics of the distribution of the input data. We also propose a method of obtaining an error surface to match the distribution of the given data. The distribution capturing ability of AANN models is illustrated in the context of speaker verification. (C) 2002 Published by Elsevier Science Ltd.

引用

页码：459 / 469

页数：11

共 29 条

[1] EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].

ATAL, BS .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312

[2]

Bishop C. M., 1995, NEURAL NETWORKS PATT

[3]

DeMers D., 1993, Advances in Neural Information Processing Systems, P580

[4] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[5]

Diamantaras KI, 1996, Principal Component Neural Networks: Theory and Applications

[6] Impostor cohort selection for score normalisation in speaker verification [J].

Finan, RA ;

Sapeluk, AT ;

Damper, RI .

PATTERN RECOGNITION LETTERS, 1997, 18 (09) :881-888

[7] CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].

FURUI, S .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272

[8] Autoassociator-based models for speaker verification [J].

Gori, M ;

Lastrucci, L ;

Soda, G .

PATTERN RECOGNITION LETTERS, 1996, 17 (03) :241-250

[9]

Gray R. M., 1984, IEEE ASSP Magazine, V1, P4, DOI 10.1109/MASSP.1984.1162229

[10]

Haykin S., 1999, Neural Networks: A Comprehensive Foundation, V2nd ed

← 1 2 3 →