BINARIZATION AND MULTITHRESHOLDING OF DOCUMENT IMAGES USING CONNECTIVITY

被引:74
作者
OGORMAN, L
机构
[1] AT and T Bell Labs, Murray Hill
来源
CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING | 1994年 / 56卷 / 06期
关键词
D O I
10.1006/cgip.1994.1044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Thresholding is a common image processing operation applied to gray-scale images to obtain binary or multilevel images. Traditionally, one of two approaches is used: global or locally adaptive processing. However, each of these approaches has a disadvantage: the global approach neglects local information, and the locally adaptive approach neglects global information. A thresholding method is described here that is global in approach, but uses a measure of local information, namely connectivity. Thresholds are found at the intensity levels that best preserve the connectivity of regions within the image. Thus, this method has advantages of both global and locally adaptive approaches. This method is applied here to document images. Experimental comparisons against other thresholding methods show that the connectivity-preserving method yields much improved results. On binary images, this method has been shown to improve subsequent OCR recognition rates from about 958 to 97.5%. More importantly, the new method has been shown to reduce the number of binarization failures ( where text is so poorly binarized as to be totally unrecognizable by a commercial OCR system) from 33%: to 68 on difficult images. For multilevel document images, as well, the results shown similar improvement. (C) 1994 Academic Press, Inc.
引用
收藏
页码:494 / 506
页数:13
相关论文
共 12 条
[1]  
FU SK, 1981, PATTERN RECOGN, V13, P3
[2]  
JOHANNSEN G, 1982, 6TH P INT C PATT REC, P140
[3]  
KAMEL M, 1993, CVGIP-GRAPH MODEL IM, V55, P203, DOI 10.1006/cgip.1993.1015
[4]   A NEW METHOD FOR GRAY-LEVEL PICTURE THRESHOLDING USING THE ENTROPY OF THE HISTOGRAM [J].
KAPUR, JN ;
SAHOO, PK ;
WONG, AKC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 29 (03) :273-285
[5]   MINIMUM ERROR THRESHOLDING [J].
KITTLER, J ;
ILLINGWORTH, J .
PATTERN RECOGNITION, 1986, 19 (01) :41-47
[6]  
OGORMAN L, 1992, SEP INT C PATT REC I, P280
[7]   THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS [J].
OTSU, N .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01) :62-66
[8]   ENTROPIC THRESHOLDING, A NEW APPROACH [J].
PUN, T .
COMPUTER GRAPHICS AND IMAGE PROCESSING, 1981, 16 (03) :210-239
[9]   A SURVEY OF THRESHOLDING TECHNIQUES [J].
SAHOO, PK ;
SOLTANI, S ;
WONG, AKC ;
CHEN, YC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 41 (02) :233-260
[10]  
STORY GA, 1992, IEEE COMPUT, P17