A SUBSPACE PROJECTION APPROACH TO FEATURE-EXTRACTION - THE 2-DIMENSIONAL GABOR TRANSFORM FOR CHARACTER-RECOGNITION

被引:18
作者
SHUSTOROVICH, A
机构
关键词
NEURAL NETWORKS; CHARACTER RECOGNITION; GABOR WAVELETS; FEATURE EXTRACTION; SUBSPACE PROJECTION; IMAGE RECONSTRUCTION; WEIGHT FUNCTIONS;
D O I
10.1016/0893-6080(94)90010-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an application of the two-dimensional Gabor wavelets as feature extractors for character recognition with neural networks. Our approach is based on an analysis of the function performed by a single hidden unit in the first layer of a network presented with raw pixel data. This weight function can be approximated by a linear combination of basis functions from a fixed set. We establish the duality between this expansion and feature extraction: the projections of an image onto the same basis set play the role of precalculated features, and they are used as the input to the network. Recognizability of images reconstructed from these projections suggests that the necessary information is preserved by the corresponding feature extraction scheme. In this study, the Gabor wavelets provided the best trade-off between dimensionality reduction and quality of the reconstructed images. A local receptive field (LRF) network was trained on the NIST data base of isolated alphanumeric characters and tested on unseen parts of the same data base. The use of Gabor projections instead of original pixel data resulted in improvement from 86.35% to 89.40% for the lowercase, from 89.40% to 96.44% for the uppercase, and from 98.63% to 99.11% for digits, which corresponds to 22-66% reduction of classification error. This LRF-Gabor network became a part of a unified algorithm used by Eastman Kodak Company that finished in the tight group of leaders at the U.S. Census Bureau/NIST First OCR Systems Competition.
引用
收藏
页码:1295 / 1301
页数:7
相关论文
共 12 条
[1]  
[Anonymous], 1990, ADV NEURAL INF PROCE
[2]   COMPLETE DISCRETE 2-D GABOR TRANSFORMS BY NEURAL NETWORKS FOR IMAGE-ANALYSIS AND COMPRESSION [J].
DAUGMAN, JG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (07) :1169-1179
[3]   THE DESIGN AND USE OF STEERABLE FILTERS [J].
FREEMAN, WT ;
ADELSON, EH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (09) :891-906
[4]  
GARRIS MD, 1991, P INT JOINT C NEURAL
[5]  
Hinton G.E., 1986, P 8 ANN C COGN SCI S, V1, P12
[6]  
KANDEL ER, 1985, PRINCIPLES NEURAL SC, P366
[7]  
LeCun Y., 1989, P ADV NEURAL INFORM, P396
[8]  
LeCun Y., 1990, PROC ADV NEURAL INFO, P598, DOI DOI 10.5555/109230.109298
[9]   FEATURE-EXTRACTION BASED ON DECISION BOUNDARIES [J].
LEE, CH ;
LANDGREBE, DA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (04) :388-400
[10]  
PAWLICKI TF, 1991, COMMUNICATION