Artificial neural networks for document analysis and recognition

被引:94
作者
Marinai, S
Gori, M
Soda, G
机构
[1] Univ Florence, Dipartimento Sistemi & Informat, I-50139 Florence, Italy
[2] Univ Siena, Dipartimento Ingn Informaz, I-53100 Siena, Italy
关键词
character segmentation; document image analysis and recognition; layout analysis; neural networks; preprocessing; recursive neural networks; word recognition;
D O I
10.1109/TPAMI.2005.4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial neural networks have been extensively applied to document analysis and recognition. Most efforts have been devoted to the recognition of isolated handwritten and printed characters with widely recognized successful results. However, many other document processing tasks, like preprocessing, layout analysis, character segmentation, word recognition, and signature verification, have been effectively faced with very promising results. This paper surveys the most significant problems in the area of offline document image processing, where connectionist-based approaches have been applied. Similarities and differences between approaches belonging to different categories are discussed. A particular emphasis is given on the crucial role of prior knowledge for the conception of both appropriate architectures and learning algorithms. Finally, the paper provides a critical analysis on the reviewed approaches and depicts the most promising research guidelines in the field. In particular, a second generation of connectionist-based models are foreseen which are based on appropriate graphical representations of the learning environment.
引用
收藏
页码:23 / 35
页数:13
相关论文
共 102 条
[1]   A NEURAL-NETWORK-BASED DEDICATED THINNING METHOD [J].
AHMED, P .
PATTERN RECOGNITION LETTERS, 1995, 16 (06) :585-590
[2]   Hand-printed Arabic character recognition system using an artificial network [J].
Amin, A ;
AlSadoun, H ;
Fischer, S .
PATTERN RECOGNITION, 1996, 29 (04) :663-675
[3]  
[Anonymous], P 7 INT C PATT REC M
[4]   Segmentation of touching characters using an MLP [J].
Bae, JH ;
Jung, KC ;
Kim, JW ;
Kim, HJ .
PATTERN RECOGNITION LETTERS, 1998, 19 (08) :701-709
[5]   LEARNING IN MULTILAYERED NETWORKS USED AS AUTOASSOCIATORS [J].
BIANCHINI, M ;
FRASCONI, P ;
GORI, M .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (02) :512-515
[6]   IMPROVING REJECTION PERFORMANCE ON HANDWRITTEN DIGITS BY TRAINING WITH RUBBISH [J].
BROMLEY, J ;
DENKER, JS .
NEURAL COMPUTATION, 1993, 5 (03) :367-370
[7]   EXPERIMENTS ON NEURAL NET RECOGNITION OF SPOKEN AND WRITTEN TEXT [J].
BURR, DJ .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (07) :1162-1168
[8]  
CAELLI TM, 2003, P PRASA 2003 LANG S, P1
[9]  
Cardot H., 1994, International Journal of Pattern Recognition and Artificial Intelligence, V8, P679, DOI 10.1142/S021800149400036X
[10]   FUZZY ARTMAP - A NEURAL NETWORK ARCHITECTURE FOR INCREMENTAL SUPERVISED LEARNING OF ANALOG MULTIDIMENSIONAL MAPS [J].
CARPENTER, GA ;
GROSSBERG, S ;
MARKUZON, N ;
REYNOLDS, JH ;
ROSEN, DB .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :698-713