KEYWORD SPOTTING IN POORLY PRINTED DOCUMENTS USING PSEUDO-2D HIDDEN MARKOV-MODELS

被引:146
作者
KUO, SS [1 ]
AGAZZI, OE [1 ]
机构
[1] AT&T BELL LABS,SIGNAL PROC RES DEPT,MURRAY HILL,NJ 07974
关键词
OPTICAL CHARACTER RECOGNITION; KEYWORD RECOGNITION; 2-D HIDDEN MARKOV MODEL; DYNAMIC PROGRAMMING;
D O I
10.1109/34.308482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An algorithm for robust machine recognition of keywords embedded in a poorly printed document is presented. For each keyword, two statistical models, named pseudo 2-D Hidden Markov Models, are created for representing the actual keyword and all the other extraneous words, respectively. Dynamic programming is then used for matching an unknown input word with the two models and for making a maximum likelihood decision. Although the models are pseudo 2-D in the sense that they are not fully connected 2-D networks, they are shown to be general enough in characterizing printed words efficiently. These models facilitate a nice ''elastic matching'' property in both horizontal and vertical directions, which makes the recognizer not only independent of size and slant but also tolerant of highly deformed and noisy words. The system is evaluated on a synthetically created database that contains about 26000 words. Currently, we achieve the recognition accuracy of 99% when words in testing and training sets are or the same font size, and 96% when they are in different sizes. In the latter case, the conventional 1-D HMM achieves only a 70% accuracy rate.
引用
收藏
页码:842 / 848
页数:7
相关论文
共 11 条
[1]   HIDDEN MARKOV MODEL-BASED OPTICAL CHARACTER-RECOGNITION IN THE PRESENCE OF DETERMINISTIC TRANSFORMATIONS [J].
AGAZZI, OE ;
KUO, SS .
PATTERN RECOGNITION, 1993, 26 (12) :1813-1826
[2]  
AGAZZI OE, 1993, P ICASSP 93
[3]   ANATOMY OF A VERSATILE PAGE READER [J].
BAIRD, HS .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1059-1065
[4]  
BELAID A, 1991, 5TH P INT S APPL STO
[5]  
BOSE C, 1992, 11TH P INT C PATT RE
[6]  
CHEN F, 1993, P ICASSP 93
[7]  
HE Y, 1992, P ICASSP 92
[8]  
KUO S, 1994, IN PRESS J VISUAL CO, V5
[9]  
LEVIN E, 1992, P ICASSP 92
[10]   AN INTRODUCTION TO THE APPLICATION OF THE THEORY OF PROBABILISTIC FUNCTIONS OF A MARKOV PROCESS TO AUTOMATIC SPEECH RECOGNITION [J].
LEVINSON, SE ;
RABINER, LR ;
SONDHI, MM .
BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (04) :1035-1074