Automatic document processing: A survey

被引:81
作者
Tang, YY
Lee, SW
Suen, CY
机构
[1] KOREA UNIV, DEPT COMP SCI, SEONGBUK KU, SEOUL 136701, SOUTH KOREA
[2] CONCORDIA UNIV, CTR PATTERN RECOGNIT & MACHINE INTELLIGENCE, MONTREAL, PQ H3G 1M8, CANADA
关键词
document processing; document analysis and understanding; geometric and logical structures; hierarchical and no-hierarchical methods; tree transform; formatting knowledge; description languages; texture analysis;
D O I
10.1016/S0031-3203(96)00044-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysis and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc, which have been used in these approaches are discussed. Copyright (C) 1996 Pattern Recognition Society.
引用
收藏
页码:1931 / 1952
页数:22
相关论文
共 105 条
[81]   AUTOMATIC RECOGNITION OF HANDPRINTED CHARACTERS - THE STATE OF THE ART [J].
SUEN, CY ;
BERTHOD, M ;
MORI, S .
PROCEEDINGS OF THE IEEE, 1980, 68 (04) :469-487
[82]  
SUEN CY, 1989, DOCUMENT LAYOUT LOGI
[83]  
Tang Y. Y., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P567, DOI 10.1109/ICDAR.1995.601960
[84]  
TANG YY, 1994, IEEE T KNOWL DATA EN, V6, P3, DOI 10.1109/69.273022
[85]  
TANG YY, 1991, 1ST P INT C DOC AN R, P17
[86]  
TANG YY, 1991, P INT C COMP PROC CH, P313
[87]  
TANG YY, 1993, HDB PATTERN RECOGNIT, P625
[88]  
TANG YY, 1990, CENPAR2 CONC U
[89]  
Taylor S. L., 1992, Machine Vision and Applications, V5, P211, DOI 10.1007/BF02626999
[90]  
TOYODA J, 1982, 6TH P INT C PATT REC, P1113