A robust and fast skew detection algorithm for generic documents

被引:87
作者
Yu, B [1 ]
Jain, AK [1 ]
机构
[1] MICHIGAN STATE UNIV, DEPT COMP SCI, E LANSING, MI 48824 USA
关键词
skew detection; document image processing; hierarchical Hough transform; block adjacency graph; connected components; HOUGH TRANSFORM;
D O I
10.1016/0031-3203(96)00020-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust and fast skew detection algorithm based on hierarchical Hough transform is proposed. It is capable of detecting the skew angle for various document images, including technical articles, postal labels, handwritten text, forms, drawings and bar codes. The algorithm is robust even when black margins introduced by photocopying are present in the image and when the document is scanned at a low resolution of 50 dpi. The algorithm consists of two steps. In the first step we quickly extract the centroids of connected components using a graph data structure. Then, a hierarchical Hough transform (at two different angular resolutions) is applied to the selected centroids. The skew angle corresponds to the location of the highest peak in the Hough space. The performance of the algorithm is shown on a number of document images collected from various application domains. The algorithm is not very sensitive to algorithmic parameters. For an A4 size document image scanned at 50 dpi (typically 413 x 575 pixels), our algorithm is able to detect the skew angle with an accuracy of 0.1 degrees in 0.4s of CPU time on a SunSparc 20 workstation. Copyright (C) 1996 Pattern Recognition Society.
引用
收藏
页码:1599 / 1629
页数:31
相关论文
共 21 条
[1]   AUTOMATED ENTRY SYSTEM FOR PRINTED DOCUMENTS [J].
AKIYAMA, T ;
HAGITA, N .
PATTERN RECOGNITION, 1990, 23 (11) :1141-1154
[2]  
[Anonymous], P 3 IAPR INT C DOC A
[3]  
BAIRD HS, 1987, 40TH P SPSE C S HYBR, P21
[4]   GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES [J].
BALLARD, DH .
PATTERN RECOGNITION, 1981, 13 (02) :111-122
[5]  
CHEN S, 1994, P IEEE INT C IM PROC, P139
[6]   A METHOD OF DETECTING THE ORIENTATION OF ALIGNED COMPONENTS [J].
HASHIZUME, A ;
YEH, PS ;
ROSENFELD, A .
PATTERN RECOGNITION LETTERS, 1986, 4 (02) :125-132
[7]  
HINDS S, 1990, 10TH P INT C PATT RE, P464
[8]   A SURVEY OF THE HOUGH TRANSFORM [J].
ILLINGWORTH, J ;
KITTLER, J .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 44 (01) :87-116
[9]  
Ishitani Y., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P49, DOI 10.1109/ICDAR.1993.395784
[10]  
LIU J, 1992, 11TH IAPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, PROCEEDINGS, VOL III, P122, DOI 10.1109/ICPR.1992.201942