PSEUDO 2-DIMENSIONAL HIDDEN MARKOV-MODELS FOR DOCUMENT RECOGNITION

被引:10
作者
AGAZZI, OE [1 ]
KUO, SS [1 ]
机构
[1] AT&T IMAGE SOLUT,SOMERSET,NJ
来源
AT&T TECHNICAL JOURNAL | 1993年 / 72卷 / 05期
关键词
Algorithms;
D O I
10.1002/j.1538-7305.1993.tb00655.x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hidden Markov models (HMM) have become the most popular technique for automatic speech recognition. Extending this technique to the two-dimensional domain is a promising approach to solving difficult problems in optical character recognition (OCR), such as recognizing poorly printed text. Hidden Markov models are robust for OCR applications due to: Their inherent tolerance to noise and distortion, Their ability to segment blurred and connected text into words and characters as an integral part of the recognition process, Their invariance to size, slant, and other transformations of the basic characters, and The ease with which contextual information and language models can be incorporated into the recognition algorithms. We give a brief overview of OCR algorithms based on two-dimensional hidden Markov models, and we present three case studies that show their remarkable strengths.
引用
收藏
页码:60 / 72
页数:13
相关论文
共 18 条
[1]   ANATOMY OF A VERSATILE PAGE READER [J].
BAIRD, HS .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1059-1065
[2]   OMNIDOCUMENT TECHNOLOGIES [J].
BOKSER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1066-1078
[3]  
BOSE C, 1992, 11TH P INT C PATT RE
[4]  
CASEY RG, 1982, 6TH INT C P R MUN, P1023
[5]  
CHOU P, 1989, NOV SPIE C VIS COMM
[6]   CHARACTER-RECOGNITION - A REVIEW [J].
GOVINDAN, VK ;
SHIVAPRASAD, AP .
PATTERN RECOGNITION, 1990, 23 (07) :671-683
[7]  
Hopcroft J. E., 1979, INTRO AUTOMATA THEOR
[8]   ON THE RECOGNITION OF PRINTED CHARACTERS OF ANY FONT AND SIZE [J].
KAHAN, S ;
PAVLIDIS, T ;
BAIRD, HS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (02) :274-288
[9]  
KUO S, 1993, JUN P INT C COMP VIS
[10]  
KUO S, 1993, 1993 INT C AC SPEECH