Component-based discriminative classification for hidden Markov models

被引:20
作者
Bicego, Manuele [1 ,2 ]
Pekalska, Elzbieta [3 ]
Tax, David M. J. [4 ]
Duin, Robert P. W. [4 ]
机构
[1] Univ Verona, Dept Comp Sci, I-37134 Verona, Italy
[2] Univ Sassari, DEIR, I-07100 Sassari, Italy
[3] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[4] Delft Univ Technol, NL-2628 CD Delft, Netherlands
基金
英国工程与自然科学研究理事会;
关键词
Hidden Markov models; Discriminative classification; Dimensionality reduction; Hybrid models; Generative embeddings;
D O I
10.1016/j.patcog.2009.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models (HMMs) have been successfully applied to a wide range of sequence modeling problems. In the classification context, one of the simplest approaches is to train a single HMM per class. A test sequence is then assigned to the class whose HMM yields the maximum a posterior (MAP) probability. This generative scenario works well when the models are correctly estimated. However, the results can become poor when improper models are employed, due to the lack of prior knowledge, poor estimates, violated assumptions or insufficient training data. To improve the results in these cases we propose to combine the descriptive strengths of HMMs with discriminative classifiers. This is achieved by training feature-based classifiers in an HMM-induced vector space defined by specific components of individual hidden Markov models. We introduce four major ways of building Such vector spaces and study which trained combiners are useful in which context. Moreover, we motivate and discuss the merit of our method in comparison to dynamic kernels, in particular, to the Fisher Kernel approach. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2637 / 2648
页数:12
相关论文
共 64 条
[41]  
Jaakkola TS, 1999, ADV NEUR IN, V11, P487
[42]  
Jelinek F., 1998, STAT METHODS SPEECH
[43]  
Kadous MW, 1999, MACHINE LEARNING, PROCEEDINGS, P454
[44]   Overall risk criterion estimation of hidden Markov model parameters [J].
Kaiser, J ;
Horvat, B ;
Kacic, Z .
SPEECH COMMUNICATION, 2002, 38 (3-4) :383-398
[45]  
KAISER Z, 2000, INT C SPOK LANG PROC, V2, P887
[46]   HIDDEN MARKOV-MODELS IN COMPUTATIONAL BIOLOGY - APPLICATIONS TO PROTEIN MODELING [J].
KROGH, A ;
BROWN, M ;
MIAN, IS ;
SJOLANDER, K ;
HAUSSLER, D .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (05) :1501-1531
[47]   Multidimensional curve classification using passing-through regions [J].
Kudo, M ;
Toyama, J ;
Shimbo, M .
PATTERN RECOGNITION LETTERS, 1999, 20 (11-13) :1103-1111
[48]  
LAFFERTY J, 2001, INT C MACH LEARN, P591
[49]   A study on combining image representations for image classification and retrieval [J].
Lai, C ;
Tax, DMJ ;
Duin, RPW ;
Pekalska, E ;
Paclík, P .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (05) :867-890
[50]  
LAYTON M, 2005, ADV NEURAL INFORM PR