Learning low-level vision

被引:1137
作者
Freeman, WT
Pasztor, EC
Carmichael, OT
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] MIT, Media Lab, Cambridge, MA 02139 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
vision and learning; belief propagation; low-level vision; super-resolution; shading and reflectance; motion estimation;
D O I
10.1023/A:1026501619075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based method for low-level vision problems-estimating scenes from images. We generate a synthetic world of scenes and their corresponding rendered images, modeling their relationships with a Markov network. Bayesian belief propagation allows us to efficiently find a local maximum of the posterior probability for the scene, given an image. We call this approach VISTA-Vision by Image/Scene TrAining. We apply VISTA to the "super-resolution" problem (estimating high frequency details from a low-resolution image), showing good results. To illustrate the potential breadth of the technique, we also apply it in two other problem domains, both simplified. We learn to distinguish shading from reflectance variations in a single image under particular lighting conditions. For the motion estimation problem in a "blobs world", we show figure/ground discrimination, solution of the aperture problem, and filling-in arising from application of the same probabilistic machinery.
引用
收藏
页码:25 / 47
页数:23
相关论文
共 50 条
  • [1] Adelson E.H., 1991, Computational Models of Visual Processing, V1, P3
  • [2] ADELSON EH, 1995, COMMUNICATION
  • [3] [Anonymous], P EUR C COMP VIS
  • [4] COMPUTATIONAL VISION
    BARROW, HG
    TENENBAUM, JM
    [J]. PROCEEDINGS OF THE IEEE, 1981, 69 (05) : 572 - 595
  • [5] The ''independent components'' of natural scenes are edge filters
    Bell, AJ
    Sejnowski, TJ
    [J]. VISION RESEARCH, 1997, 37 (23) : 3327 - 3338
  • [6] BERGER J. O., 2013, Statistical Decision Theory and Bayesian Analysis, DOI [10.1007/978-1-4757-4286-2, DOI 10.1007/978-1-4757-4286-2]
  • [7] BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
  • [8] BINFORD T, 1988, UNCERTAINTY ARTIFICI
  • [9] Bishop C. M., 1995, NEURAL NETWORKS PATT
  • [10] THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE
    BURT, PJ
    ADELSON, EH
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) : 532 - 540