The hierarchical hidden Markov model: Analysis and applications

被引:489
作者
Fine, S [1 ]
Singer, Y
Tishby, N
机构
[1] Hebrew Univ Jerusalem, Inst Comp Sci, IL-91904 Jerusalem, Israel
[2] AT&T Bell Labs, Florham Pk, NJ 07932 USA
关键词
statistical models; temporal pattern recognition; hidden variable models; cursive handwriting;
D O I
10.1023/A:1007469218079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce, analyze and demonstrate a recursive hierarchical generalization of the widely used hidden Markov models, which we name Hierarchical Hidden Markov Models (HHMM). Our model is motivated by the complex multi-scale structure which appears in many natural sequences, particularly in language, handwriting and speech. We seek a systematic unsupervised approach to the modeling of such structures. By extending the standard Baum-Welch (forward-backward) algorithm, we derive an efficient procedure for estimating the model parameters from unlabeled data. We then use the trained model for automatic hierarchical parsing of observation sequences. We describe two applications of our model and its parameter estimation procedure. In the first application we show how to construct hierarchical models of natural English text. In these models different levels of the hierarchy correspond to structures on different length scales in the text. In the second application we demonstrate how HHMMs can be used to automatically identify repeated strokes that represent combination of letters in cursive handwriting.
引用
收藏
页码:41 / 62
页数:22
相关论文
共 23 条
  • [11] Gillman D., 1994, Proceedings of the Seventh Annual ACM Conference on Computational Learning Theory, COLT 94, P147, DOI 10.1145/180139.181091
  • [12] JELINEK F, 1985, SELF ORG LANGUAGE MO
  • [13] JELINEK F, 1985, ROBUST PART SPEECH T
  • [14] JELINEK F, 1983, MARKOV SOURCE MODELI
  • [15] A HIDDEN MARKOV MODEL THAT FINDS GENES IN ESCHERICHIA-COLI DNA
    KROGH, A
    MIAN, IS
    HAUSSLER, D
    [J]. NUCLEIC ACIDS RESEARCH, 1994, 22 (22) : 4768 - 4778
  • [16] LARI K, 1990, COMPUTERS SPEECH LAN, V4
  • [17] NAG R, 1985, P INT C AC SPEECH SI, P2071
  • [18] COMPLEXITY OF STRINGS IN THE CLASS OF MARKOV SOURCES
    RISSANEN, J
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1986, 32 (04) : 526 - 532
  • [19] Singer Y, 1997, ADV NEUR IN, V9, P641
  • [20] SINGER Y, 1994, BIOL CYBERN, V71, P227, DOI 10.1007/BF00202762