Computation of pattern invariance in brain-like structures

被引:37
作者
Ullman, S [1 ]
Soloviev, S
机构
[1] Weizmann Inst Sci, Dept Appl Math & Comp Sci, IL-76100 Rehovot, Israel
[2] Weizmann Inst Sci, Dept Neurobiol, IL-76100 Rehovot, Israel
关键词
shift invariance; pattern invariance; object recognition; visual system;
D O I
10.1016/S0893-6080(99)00048-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental capacity of the perceptual systems and the brain in general is to deal with the novel and the unexpected. In vision, we can effortlessly recognize a familiar object under novel viewing conditions, or recognize a new object as a member of a familiar class, such as a house, a face, or a car. This ability to generalize and deal efficiently with novel stimuli has long been considered a challenging example of brain-like computation that proved extremely difficult to replicate in artificial systems. In this paper we present an approach to generalization and invariant recognition. We focus our discussion on the problem of invariance to position in the visual field, but also sketch how similar principles could apply to other domains. The approach is based on the use of a large repertoire of partial generalizations that are built upon past experience. In the case of shift invariance, visual patterns are described as the conjunction of multiple overlapping image fragments. The invariance to the more primitive fragments is built into the system by past experience. Shift invariance of complex shapes is obtained from the invariance of their constituent fragments. We study by simulations aspects of this shift invariance method and then consider its extensions to invariant perception and classification by brain-like structures. (C) 1999 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1021 / 1036
页数:16
相关论文
共 36 条
[1]  
[Anonymous], 1996, HIGH LEVEL VISION OB
[2]   HUMAN IMAGE UNDERSTANDING - RECENT RESEARCH AND A THEORY [J].
BIEDERMAN, I .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 32 (01) :29-73
[3]   EVIDENCE FOR COMPLETE TRANSLATIONAL AND REFLECTIONAL INVARIANCE IN VISUAL OBJECT PRIMING [J].
BIEDERMAN, I ;
COOPER, EE .
PERCEPTION, 1991, 20 (05) :585-593
[4]  
BRICOLO E, 1993, PERCEPTION S, V22, P105
[5]  
BRICOLO E, 1992, PERCEPTION S2, V21, P59
[6]  
DESIMONE R, 1984, J NEUROSCI, V4, P2051
[7]   The role of visual field position in pattern-discrimination learning [J].
Dill, M ;
Fahle, M .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1997, 264 (1384) :1031-1036
[8]  
DILL M, 1997, 1610 AI MIT
[9]   SIZE INVARIANCE IN VISUAL OBJECT PRIMING OF GRAY-SCALE IMAGES [J].
FISER, J ;
BIEDERMAN, I .
PERCEPTION, 1995, 24 (07) :741-748
[10]   Learning Invariance from Transformation Sequences [J].
Foldiak, Peter .
NEURAL COMPUTATION, 1991, 3 (02) :194-200