Component-based discriminative classification for hidden Markov models

被引:20
作者
Bicego, Manuele [1 ,2 ]
Pekalska, Elzbieta [3 ]
Tax, David M. J. [4 ]
Duin, Robert P. W. [4 ]
机构
[1] Univ Verona, Dept Comp Sci, I-37134 Verona, Italy
[2] Univ Sassari, DEIR, I-07100 Sassari, Italy
[3] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[4] Delft Univ Technol, NL-2628 CD Delft, Netherlands
基金
英国工程与自然科学研究理事会;
关键词
Hidden Markov models; Discriminative classification; Dimensionality reduction; Hybrid models; Generative embeddings;
D O I
10.1016/j.patcog.2009.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models (HMMs) have been successfully applied to a wide range of sequence modeling problems. In the classification context, one of the simplest approaches is to train a single HMM per class. A test sequence is then assigned to the class whose HMM yields the maximum a posterior (MAP) probability. This generative scenario works well when the models are correctly estimated. However, the results can become poor when improper models are employed, due to the lack of prior knowledge, poor estimates, violated assumptions or insufficient training data. To improve the results in these cases we propose to combine the descriptive strengths of HMMs with discriminative classifiers. This is achieved by training feature-based classifiers in an HMM-induced vector space defined by specific components of individual hidden Markov models. We introduce four major ways of building Such vector spaces and study which trained combiners are useful in which context. Moreover, we motivate and discuss the merit of our method in comparison to dynamic kernels, in particular, to the Fisher Kernel approach. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2637 / 2648
页数:12
相关论文
共 64 条
[51]   Acoustic modelling using continuous rational kernels [J].
Layton, Martin ;
Gales, Mark .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2007, 48 (1-2) :67-82
[52]   Text classification using string kernels [J].
Lodhi, H ;
Saunders, C ;
Shawe-Taylor, J ;
Cristianini, N ;
Watkins, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :419-444
[53]   Shallow parsing using specialized HMMs [J].
Molina, A ;
Pla, F .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (04) :595-613
[54]   Cyclic sequence alignments: Approximate versus optimal techniques [J].
Mollineda, RA ;
Vidal, E ;
Casacuberta, F .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2002, 16 (03) :291-299
[55]  
NA K, 1995, EUR C SPEECH COMM TE, P97
[56]  
NOWAK R, 1999, LECT NOTES STAT, V141
[57]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[58]  
Scholkopf B, 2002, Encyclopedia of Biostatistics
[59]  
Smith N, 2002, ADV NEUR IN, V14, P1197
[60]  
Smith N.D., 2003, THESIS CAMBRIDGE U