CHARACTER-RECOGNITION WITHOUT SEGMENTATION

被引:36
作者
ROCHA, J [1 ]
PAVLIDIS, T [1 ]
机构
[1] SUNY STONY BROOK,DEPT COMP SCI,STONY BROOK,NY 11794
基金
美国国家科学基金会;
关键词
CHARACTER RECOGNITION WITHOUT SEGMENTATION; BROKEN CHARACTER RECOGNITION; TOUCHING CHARACTER RECOGNITION; HOMEOMORPHIC SUBGRAPH MATCHING; RELATIVE NEIGHBORHOOD GRAPH;
D O I
10.1109/34.406657
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A segmentation-free approach to OCR is presented as part of a knowledge based word interpretation model. This new method is based on the recognition of subgraphs homeomorphic to previously defined prototypes of characters [16]. Gaps are. identified as potential parts of characters by implementing a variant of the notion of relative neighborhood used in computational perception. In the system, each subgraph of strokes that matches a previously defined character prototype is recognized anywhere in the word even if it corresponds to a broken character or to a character touching another one. The characters are detected in the order defined by the matching quality. Each subgraph:that is recognized is introduced as a node in a directed net that compiles different alternatives of interpretation of the features in the feature graph. A path in the net represents a consistent succession of characters in the word. The method allows the recognition of characters that overlap or that are underlined. A final search for the optimal path under certain criteria gives the best interpretation of the word features. The character recognized uses a flexible matching between the features and a flexible grouping of the individual features to be matched. Broken characters are recognized by looking for gaps between features that may he interpreted as part of a character. Touching characters are recognized because the matching allows nonmatched adjacent strokes. The recognition results of this system for over 24,000 printed numeral characters belonging to a USPS database and on some hand-printed words confirmed the method's high robustness level.
引用
收藏
页码:903 / 909
页数:7
相关论文
共 23 条
[11]  
LECOLINET E, 1991, 1ST P INT C DOC AN R, P740
[12]  
Lowe D. G., 1985, PERCEPTUAL ORG VISUA
[13]  
OHMORI K, 1993, 3RD P INT WORKSH FRO, P242
[14]  
Okamoto M., 1991, INT C DOC AN REC, P242
[15]  
Radke JD, 1988, SHAPE SET POINTS, P105
[16]   A SHAPE-ANALYSIS MODEL WITH APPLICATIONS TO A CHARACTER-RECOGNITION SYSTEM [J].
ROCHA, J ;
PAVLIDIS, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (04) :393-404
[17]  
SAKODA WJ, 1993, FEB SPIE S EL IM TEC, P21
[18]   DOCUMENT ANALYSIS - FROM PIXELS TO CONTENTS [J].
SCHURMANN, J ;
BARTNECK, N ;
BAYER, T ;
FRANKE, J ;
MANDLER, E ;
OBERLANDER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1101-1119
[19]   THE RELATIVE NEIGHBORHOOD GRAPH, WITH AN APPLICATION TO MINIMUM SPANNING-TREES [J].
SUPOWIT, KJ .
JOURNAL OF THE ACM, 1983, 30 (03) :428-448
[20]  
TORIWAKI J, 1988, COMPUTATIONAL MORPHO, P207