Optical character recognition for cursive handwriting

被引:102
作者
Arica, N [1 ]
Yarman-Vural, FT [1 ]
机构
[1] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
关键词
handwritten word recognition; preprocessing; segmentation; optical character recognition; cursive handwriting; hidden Markov model; search; graph; lexicon matching;
D O I
10.1109/TPAMI.2002.1008386
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new analytic scheme, which uses a sequence of segmentation and recognition algorithms, is proposed for offline cursive handwriting recognition problem. First, some global parameters, such as slant angle, baselines, and stroke width and height are estimated. Second, a segmentation method finds character segmentation paths by combining gray scale and binary information. Third, Hidden Markov Model (HMM) is employed for shape recognition to label and rank the character candidates. For this purpose, a string of codes is extracted from each segment to represent the character candidates. The estimation of feature space parameters is embedded in HMM training stage together with the estimation of the HMM model parameters. Finally, the lexicon information and HMM ranks are combined in a graph optimization problem for word-level recognition. This method corrects most of the errors produced by segmentation and HMM ranking stages by maximizing an information measure in an efficient graph search algorithm. The experiments in dicate higher recognition rates compared to the available methods reported in the literature.
引用
收藏
页码:801 / 813
页数:13
相关论文
共 23 条
[1]   One-dimensional representation of two-dimensional information for HMM based handwriting recognition [J].
Arica, N ;
Yarman-Vural, FT .
PATTERN RECOGNITION LETTERS, 2000, 21 (6-7) :583-592
[2]  
Arica N, 1998, INT C PATT RECOG, P1127, DOI 10.1109/ICPR.1998.711893
[3]   A heuristic algorithm for optical character recognition of Arabic script [J].
Atici, AA ;
YarmanVural, FT .
SIGNAL PROCESSING, 1997, 62 (01) :87-99
[4]  
Caesar T., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P382, DOI 10.1109/ICDAR.1995.599018
[5]  
Casey R. G., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P1028, DOI 10.1109/ICDAR.1995.602078
[6]  
Dengel A., 1997, HDB CHARACTER RECOGN, P227
[7]   Automated forms-processing software and services [J].
Gopisetty, S ;
Lorie, R ;
Mao, J ;
Mohiuddin, M ;
Sorin, A ;
Yair, E .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1996, 40 (02) :211-230
[8]  
Gorski N., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P523, DOI 10.1109/ICDAR.1999.791840
[9]   Recognition of legal amounts on bank cheques [J].
Guillevic, D ;
Suen, CY .
PATTERN ANALYSIS AND APPLICATIONS, 1998, 1 (01) :28-41
[10]   A lexicon driven approach to handwritten word recognition for real-time applications [J].
Kim, G ;
Govindaraju, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (04) :366-379