Object recognition with gradient-based learning

被引:610
作者
LeCun, Y [1 ]
Haffner, P [1 ]
Bottou, L [1 ]
Bengio, Y [1 ]
机构
[1] AT&T Shannon Lab, Red Bank, NJ 07701 USA
来源
SHAPE, CONTOUR AND GROUPING IN COMPUTER VISION | 1999年 / 1681卷
关键词
D O I
10.1007/3-540-46805-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding an appropriate set of features is an essential problem in the design of shape recognition systems. This paper attempts to show that for recognizing simple objects with high shape variability such as handwritten characters, it is possible, and even advantageous, to feed the system directly with minimally processed images and to rely on learning to extract the right set of features. Convolutional Neural Networks are shown to be particularly well suited to this task. We also show that these networks can be used to recognize multiple objects without requiring explicit segmentation of the objects from their surrounding. The second part of the paper presents the Graph Transformer Network model which extends the applicability of gradient-based learning to systems that use graphs to represents features, objects, and their combinations.
引用
收藏
页码:319 / 345
页数:27
相关论文
共 33 条
[1]  
[Anonymous], 1993, ADV NEURAL INF PROCE
[2]  
Bengio Y., 1995, NEURAL COMPUTATION, V7
[3]  
BOTTOU L, 1991, ADV NEURAL INFORMATI, V3
[4]  
Bottou L., 1997, P COMP VIS PATT REC
[5]  
BURGES C, 1997, ADV NEURAL INFORMATI, V9
[6]  
DRIANCOURT X, 1991, P INT JOINT C NEUR N
[7]   COGNITRON - SELF-ORGANIZING MULTILAYERED NEURAL NETWORK [J].
FUKUSHIMA, K .
BIOLOGICAL CYBERNETICS, 1975, 20 (3-4) :121-136
[8]   NEOCOGNITRON - A NEW ALGORITHM FOR PATTERN-RECOGNITION TOLERANT OF DEFORMATIONS AND SHIFTS IN POSITION [J].
FUKUSHIMA, K ;
MIYAKE, S .
PATTERN RECOGNITION, 1982, 15 (06) :455-469
[9]   RECEPTIVE FIELDS, BINOCULAR INTERACTION AND FUNCTIONAL ARCHITECTURE IN CATS VISUAL CORTEX [J].
HUBEL, DH ;
WIESEL, TN .
JOURNAL OF PHYSIOLOGY-LONDON, 1962, 160 (01) :106-&
[10]  
KEELER J, 1991, ADV NEURAL INFORMATI, V3, P557