A comparison of system architectures for intelligent document understanding

被引:8
作者
Farrow, GSD [1 ]
Xydeas, CS [1 ]
Oakley, JP [1 ]
Khorabi, A [1 ]
Prelcic, NG [1 ]
机构
[1] UNIV MANCHESTER,SCH ENGN,DIV ELECT ENGN,MULTIMEDIA RES LAB,MANCHESTER M13 9PL,LANCS,ENGLAND
关键词
document understanding; page layout analysis; document image processing; ODA;
D O I
10.1016/S0923-5965(96)00002-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Intelligent document understanding (IDU) is the process of converting scanned document images into a high level representation which describes the document's layout and logical structure, in addition to providing its information content. In this paper we discuss IDU in general and address a specific problem within this domain concerning the extraction of the layout structure of pages from a technical journal. Three different architectural approaches to accomplishing this task are proposed. Firstly we describe a novel document understanding system (System A) which exploits a hybrid bottom-up/top-down control architecture. The system uses a variety of image processing algorithms in a bottom-up manner. Conversely, a system based on a pure top-down architecture (System B) is then proposed which produces a segmentation of the page via projection profile analysis and achieves classification of image regions via procedural deduction. Finally, an alternative top-down architecture (System C) is described in which an optimised segmentation scheme is applied to produce partitioned blocks. These are then classified in a goal driven manner using a decision tree. A comparison of the three systems is made by measuring system performance on images obtained from a specific class of input document. The performance of document understanding systems has been quantified in terms of an object identification rate and the percentage of column area successfully interpreted. Using these measures, System A has given superior results to the two top-down systems presented. System A also performs significantly better than a previously reported top-down system operating on a comparable problem (Viswanathan, 1990).
引用
收藏
页码:1 / 19
页数:19
相关论文
共 12 条
[1]  
Ballard D.H., 1982, Computer Vision
[2]   SYSTEM FOR AN INTELLIGENT OFFICE DOCUMENT ANALYSIS, RECOGNITION AND DESCRIPTION [J].
CHAUVET, P ;
LOPEZKRAHE, J ;
TAFLIN, E ;
MAITRE, H .
SIGNAL PROCESSING, 1993, 32 (1-2) :161-190
[3]  
DENGEL A, 1990, P IAPR WORKSH SYNT S, P70
[4]  
FARROW GSD, 1990, P EUSIPCO 90 BARC
[5]  
HART PE, 1968, IEEE T SSC, V4
[6]  
KHORABI A, 1994, THESIS MANCHESTER U
[7]   A PROTOTYPE DOCUMENT IMAGE-ANALYSIS SYSTEM FOR TECHNICAL JOURNALS [J].
NAGY, G ;
SETH, S ;
VISWANATHAN, M .
COMPUTER, 1992, 25 (07) :10-22
[8]  
Nagy G., 1984, Seventh International Conference on Pattern Recognition (Cat. No. 84CH2046-1), P347
[9]  
*ODA, 1985, ECMA101 ODA
[10]  
STORY G, 1992, IEEE COMPUT, V25, P17