Learning low-level vision

被引：1137

作者：

Freeman, WT

Pasztor, EC

Carmichael, OT

机构：

[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

[2] MIT, Media Lab, Cambridge, MA 02139 USA

[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2000年 / 40卷 / 01期

关键词：

vision and learning; belief propagation; low-level vision; super-resolution; shading and reflectance; motion estimation;

D O I：

10.1023/A:1026501619075

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a learning-based method for low-level vision problems-estimating scenes from images. We generate a synthetic world of scenes and their corresponding rendered images, modeling their relationships with a Markov network. Bayesian belief propagation allows us to efficiently find a local maximum of the posterior probability for the scene, given an image. We call this approach VISTA-Vision by Image/Scene TrAining. We apply VISTA to the "super-resolution" problem (estimating high frequency details from a low-resolution image), showing good results. To illustrate the potential breadth of the technique, we also apply it in two other problem domains, both simplified. We learn to distinguish shading from reflectance variations in a single image under particular lighting conditions. For the motion estimation problem in a "blobs world", we show figure/ground discrimination, solution of the aperture problem, and filling-in arising from application of the same probabilistic machinery.

引用

页码：25 / 47

页数：23

共 50 条

[1] Adelson E.H., 1991, Computational Models of Visual Processing, V1, P3
[2] ADELSON EH, 1995, COMMUNICATION
[3] [Anonymous], P EUR C COMP VIS
[4] COMPUTATIONAL VISION
BARROW, HG
TENENBAUM, JM
[J]. PROCEEDINGS OF THE IEEE, 1981, 69 (05) : 572 - 595
[5] The ''independent components'' of natural scenes are edge filters
Bell, AJ
Sejnowski, TJ
[J]. VISION RESEARCH, 1997, 37 (23) : 3327 - 3338
[6] BERGER J. O., 2013, Statistical Decision Theory and Bayesian Analysis, DOI [10.1007/978-1-4757-4286-2, DOI 10.1007/978-1-4757-4286-2]
[7] BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[8] BINFORD T, 1988, UNCERTAINTY ARTIFICI
[9] Bishop C. M., 1995, NEURAL NETWORKS PATT
[10] THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE
BURT, PJ
ADELSON, EH
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) : 532 - 540

← 1 2 3 4 5 →