Vision as Bayesian inference: analysis by synthesis?

被引:489
作者
Yuille, Alan [1 ]
Kersten, Daniel
机构
[1] Univ Calif Los Angeles, Dept Stat, San Francisco, CA 94115 USA
[2] Univ Minnesota, Dept Psychol, Minneapolis, MN 55455 USA
关键词
D O I
10.1016/j.tics.2006.05.002
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
We argue that the study of human vision should be aimed at determining how humans perform natural tasks with natural images. Attempts to understand the phenomenology of vision from artificial stimuli, although worthwhile as a starting point, can lead to faulty generalizations about visual systems, because of the enormous complexity of natural images. Dealing with this complexity is daunting, but Bayesian inference on structured probability distributions offers the ability to design theories of vision that can deal with the complexity of natural images, and that use 'analysis by synthesis' strategies with intriguing similarities to the brain. We examine these strategies using recent examples from computer vision, and outline some important imlications for cognitive science.
引用
收藏
页码:301 / 308
页数:8
相关论文
共 49 条
[1]   The reverse hierarchy theory of visual perceptual learning [J].
Ahissar, M ;
Hochstein, S .
TRENDS IN COGNITIVE SCIENCES, 2004, 8 (10) :457-464
[2]  
[Anonymous], 1996, HIGH LEVEL VISION OB
[3]  
[Anonymous], 1982, VISION COMPUTATIONAL
[4]   Top-down facilitation of visual recognition [J].
Bar, M ;
Kassam, KS ;
Ghuman, AS ;
Boshyan, J ;
Schmidt, AM ;
Dale, AM ;
Hämäläinen, MS ;
Marinkovic, K ;
Schacter, DL ;
Rosen, BR ;
Halgren, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) :449-454
[5]   Do we know what the early visual system does? [J].
Carandini, M ;
Demb, JB ;
Mante, V ;
Tolhurst, DJ ;
Dan, Y ;
Olshausen, BA ;
Gallant, JL ;
Rust, NC .
JOURNAL OF NEUROSCIENCE, 2005, 25 (46) :10577-10597
[6]   Probabilistic models of language processing and acquisition [J].
Chater, Nick ;
Manning, Chrisiopher D. .
TRENDS IN COGNITIVE SCIENCES, 2006, 10 (07) :335-344
[7]  
Chen XR, 2004, PROC CVPR IEEE, P366
[8]  
CLARK JJ, 1990, KLUWER INT SERIES EN, V105
[9]   Efficient deformable template detection and localization without user initialization [J].
Coughlan, J ;
Yuille, A ;
English, C ;
Snow, D .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 78 (03) :303-319
[10]   THE HELMHOLTZ MACHINE [J].
DAYAN, P ;
HINTON, GE ;
NEAL, RM ;
ZEMEL, RS .
NEURAL COMPUTATION, 1995, 7 (05) :889-904