Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition

被引:46
作者
Chou, W [1 ]
机构
[1] Bell Labs, Lucent Technol, Multimedia Commun Lab, Murray Hill, NJ 07974 USA
关键词
combined string model; discriminant function; hidden Markov model; minimum recognition error rate; speech recognition;
D O I
10.1109/5.880080
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper a discriminant-function-based minimum recognition error rate pattern-recognition approach is described and studied for various applications in speech processing. This approach departs from the conventional paradigm, which links a classification/recognition task to the problem of distribution estimation. Instead, it rakes a discriminant-function-based statistical pattern recognition approach. The suitability of this approach for classification error rate minimization is established through a special loss function. It is meaningful even when the model correct ness assumption is known to be not valid. We study the theoretical basis of this approach and compare it with various criteria used in speech recognition. We differentiate the method of classifier design by way of distribution estimation and the discriminant function methods of minimizing classification error rate based on the fact that in many realistic applications, such as speech recognition, the true distribution form of the soul ce is rarely known precisely, and without model correctness assumption, the classical optimality theory of the distribution estimation approach cannot be applied directly We discuss issues in this new classifier design paradigm and present various extensions of this approach to classifier design applications in speech processing.
引用
收藏
页码:1201 / 1223
页数:23
相关论文
共 112 条
  • [81] PALIWAL KK, P EUROSPEECH 95, P541
  • [82] PAPINENI KA, P ICASSP 99
  • [83] Pollard D., 2012, Convergence of Stochastic Processes
  • [84] Rabiner L., 1993, Fundamentals of Speech Recognition
  • [85] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1989, 77 (02) : 257 - 286
  • [86] Rahim MG, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1824, DOI 10.1109/ICSLP.1996.607985
  • [87] RAHIM MG, 1997, IEEE T SPEECH AUDIO, V5
  • [88] RAHIM MG, 1996, P ICASSP 96, P3485
  • [89] RAHIM MG, 1997, COMPUT SPEECH LANG
  • [90] RATHINAVELU C, 1995, P IEEE ICASSP 95, P373