INFORMys: A flexible invoice-like form-reader system

被引:52
作者
Cesarini, F
Gori, M
Marinai, S
Soda, G
机构
[1] Univ Florence, Dipartimento Sistemi & Informat, I-50138 Firenze, Italy
[2] Univ Siena, Dipartimento Ingn Informaz, I-53100 Siena, Italy
关键词
attributed relational graphs; document analysis and recognition; document registration; invoice processing; location of information fields;
D O I
10.1109/34.689303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe a flexible form-reader system capable of extracting textual information from accounting documents, like invoices and bills of service companies. In this kind of document, the extraction of some information fields cannot take place without having detected the corresponding instruction fields, which are only constrained to range in given domains. We propose modeling the document's layout by means of attributed relational graphs, which turn out to be very effective for form registration, as well as for performing a focussed search for instruction fields. This search is carried out by means of a hybrid model, where proper algorithms, based on morphological operations and connected components, are integrated with connectionist models. Experimental results are given in order to assess the actual performance of the system.
引用
收藏
页码:730 / 745
页数:16
相关论文
共 30 条
[1]  
BAIRD HS, 1992, STRUCTURED DOCUMENT, P547
[2]   LEARNING IN MULTILAYERED NETWORKS USED AS AUTOASSOCIATORS [J].
BIANCHINI, M ;
FRASCONI, P ;
GORI, M .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (02) :512-515
[3]  
Casey R., 1992, Machine Vision and Applications, V5, P143, DOI 10.1007/BF02626994
[4]  
Cesarini F, 1997, PROC INT CONF DOC, P175, DOI 10.1109/ICDAR.1997.619836
[5]  
CESARINI F, 1996, LECT NOTES COMPUTER, P135
[6]  
CESARINI F, 1994, P INT C IECON 94, P987
[7]  
DOERMANN D, 1993, THESIS U MARYLAND CO
[8]  
Doermann D. S., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P497, DOI 10.1109/ICDAR.1993.395687
[9]   AN IMAGE UNDERSTANDING SYSTEM USING ATTRIBUTED SYMBOLIC REPRESENTATION AND INEXACT GRAPH-MATCHING [J].
ESHERA, MA ;
FU, KS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (05) :604-618
[10]   A GRAPH DISTANCE MEASURE FOR IMAGE-ANALYSIS [J].
ESHERA, MA ;
FU, KS .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1984, 14 (03) :398-408