Minimum classification error training for online handwriting recognition

被引:38
作者
Biem, A [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
minimum classification error; hidden Markov model; handwriting recognition; maximum likelihood; discriminative training; dynamic programming; finite state machine;
D O I
10.1109/TPAMI.2006.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an application of the Minimum Classification Error (MCE) criterion to the problem of recognizing online unconstrained-style characters and words. We describe an HMM-based, character and word-level MCE training aimed at minimizing the character or word error rate while enabling flexibility in writing style through the use of multiple allographs per character. Experiments on a writer-independent character recognition task covering alpha-numerical characters and keyboard symbols show that the MCE criterion achieves more than 30 percent character error rate reduction compared to the baseline Maximum Likelihood-based system. Word recognition results, on vocabularies of 5k to 10k, show that MCE training achieves around 17 percent word error rate reduction when compared to the baseline Maximum Likelihood system.
引用
收藏
页码:1041 / 1051
页数:11
相关论文
共 49 条
  • [1] ANDQUETIL E, 1996, P INF PROC MAN UNC K, P259
  • [2] [Anonymous], 1988, EMPIRICAL STUDY LEAR
  • [3] Biem A, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P868
  • [4] Minimum classification error training for online handwritten word recognition
    Biem, A
    [J]. EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 61 - 66
  • [5] Pattern recognition using discriminative feature extraction
    Biem, A
    Katagiri, S
    Juang, BH
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (02) : 500 - 504
  • [6] An application of discriminative feature extraction lo filter-bank-based speech recognition
    Biem, A
    Katagiri, S
    McDermott, E
    Juang, BH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02): : 96 - 110
  • [7] BIEM A, 1993, P IEEE INT C AC SPEE, V2, P275
  • [8] BIEM A, 1997, THESIS U PARIS 6
  • [9] Biem AE, 2001, INT CONF ACOUST SPEE, P1529, DOI 10.1109/ICASSP.2001.941223
  • [10] BISHIP CM, 1995, NEURAL NETWORK PATTE