Category learning through multimodality sensing

被引:54
作者
de Sa, VR [1 ]
Ballard, DH
机构
[1] Univ Calif San Francisco, Sloan Ctr Theoret Neurobiol, San Francisco, CA 94143 USA
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
关键词
D O I
10.1162/089976698300017368
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans and other animals learn to form complex categories without receiving a target output, or teaching signal, with each input pattern. In contrast, most computer algorithms that emulate such performance assume the brain is provided with the correct output at the neuronal level or require grossly unphysiological methods of information propagation. Natural environments do not contain explicit labeling signals, but they do contain important information in the form of temporal correlations between sensations to different sensory modalities, and humans are affected by this correlational structure (Howells, 1944; McGurk & MacDonald, 1976; MacDonald & McGurk, 1978; Zellner & Kautz, 1990; Durgin & Proffitt, 1996). In this article we describe a simple, unsupervised neural network algorithm that also uses this natural structure. Using only the co-occurring patterns of lip motion and sound signals from a human speaker, the network learns separate visual and auditory speech classifiers that perform comparably to supervised networks.
引用
收藏
页码:1097 / 1117
页数:21
相关论文
共 40 条
[1]   LONG-TERM DEPRESSION OF EXCITATORY SYNAPTIC TRANSMISSION AND ITS RELATIONSHIP TO LONG-TERM POTENTIATION [J].
ARTOLA, A ;
SINGER, W .
TRENDS IN NEUROSCIENCES, 1993, 16 (11) :480-487
[2]   SELF-ORGANIZING NEURAL NETWORK THAT DISCOVERS SURFACES IN RANDOM-DOT STEREOGRAMS [J].
BECKER, S ;
HINTON, GE .
NATURE, 1992, 355 (6356) :161-163
[4]   REPONSES SOMESTHESIQUES, VISUELLES ET AUDITIVES, RECUEILLIES AU NIVEAU DU CORTEX ASSOCIATIF SUPRASYLVIEN CHEZ LE CHAT CURARISE NON ANESTHESIE [J].
BUSER, P ;
BORENSTEIN, P .
ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1959, 11 (02) :285-304
[5]   ARTMAP - SUPERVISED REAL-TIME LEARNING AND CLASSIFICATION OF NONSTATIONARY DATA BY A SELF-ORGANIZING NEURAL NETWORK [J].
CARPENTER, GA ;
GROSSBERG, S ;
REYNOLDS, JH .
NEURAL NETWORKS, 1991, 4 (05) :565-588
[6]   A CORTICAL MODEL OF WINNER-TAKE-ALL COMPETITION VIA LATERAL INHIBITION [J].
COULTRIP, R ;
GRANGER, R ;
LYNCH, G .
NEURAL NETWORKS, 1992, 5 (01) :47-54
[7]  
De Sa VR, 1993, ADV NEURAL INFORM PR, P220
[8]  
DESA VR, 1993, COMPUTATION AND NEURAL SYSTEMS, P437
[9]   PATTERN-CLASSIFICATION BY THE BAYES MACHINE [J].
DIAMANTINI, C ;
SPALVIERI, A .
ELECTRONICS LETTERS, 1995, 31 (24) :2086-2088
[10]   Visual learning in the perception of texture: Simple and contingent aftereffects of texture density [J].
Durgin, FH ;
Proffitt, DR .
SPATIAL VISION, 1996, 9 (04) :423-474