COMBINATION OF HMMS FOR THE REPRESENTATION OF PRINTED CHARACTERS IN NOISY DOCUMENT IMAGES

被引:7
作者
ELMS, AJ
ILLINGWORTH, J
机构
[1] Department of Electronic and Electrical Engineering, University of Surrey, Guildford
关键词
CHARACTER RECOGNITION; HIDDEN MARKOV MODELS; SHALLOW CONTEXTUAL KNOWLEDGE;
D O I
10.1016/0262-8856(95)99725-G
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many methods of printed character recognition have been proposed to-date, but although performance figures are usually stated for a particular set of fonts or size of text, it is rarely clear under what conditions of noise the measurements were taken. Baird has suggested a model of Document Imaging Defects, which enables authors to compare results against an emerging standard where one figure can be quoted to quantify the level of noise present in the document image. In this paper, a novel method is proposed for the recognition of printed characters, and its extension to the segmentation and recognition of noisy printed words is outlined. The method is based on the representation of the shape of a character by two Hidden Markov Models. Recognition is achieved by scoring these models against the test pattern and combining the results. The method has been evaluated using Baird's noise model, producing a peak performance of 99.5% on the test set in the presence of near-minimal noise. The method generalizes to recognize characters with noise levels greater than those included in the training set, and an investigation of the top-k performance suggests that much of the effect of noise on the recognition performance on images of natural language text could be overcome using a word recognizer employing shallow contextual knowledge.
引用
收藏
页码:385 / 392
页数:8
相关论文
共 20 条
[1]  
AGAZZI O, 1993, 2ND P INT C DOC AN R, P155
[2]   HIDDEN MARKOV MODEL-BASED OPTICAL CHARACTER-RECOGNITION IN THE PRESENCE OF DETERMINISTIC TRANSFORMATIONS [J].
AGAZZI, OE ;
KUO, SS .
PATTERN RECOGNITION, 1993, 26 (12) :1813-1826
[3]  
ANIGBOGU A, 1991, 1ST P INT C DOC AN R, P785
[4]  
ANIGBOGU JC, 1992, 11TH IAPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, PROCEEDINGS, VOL II, P565, DOI 10.1109/ICPR.1992.201697
[5]  
BAIRD H, 1991, 1ST P INT C DOC AN R, P332
[6]  
BELAID A, 1991, 5TH P INT S APPL STO, P87
[7]   OMNIDOCUMENT TECHNOLOGIES [J].
BOKSER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1066-1078
[8]  
BOSE CB, 1992, 11TH IAPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, PROCEEDINGS, VOL II, P116, DOI 10.1109/ICPR.1992.201734
[9]  
Bunke H, 2012, STRUCTURED DOCUMENT
[10]  
CHEN FR, 1993, 2ND P INT C DOC AN R, P133