A comparative study of energy minimization methods for Markov random fields with smoothness-based priors

被引:646
作者
Szeliski, Richard [1 ]
Zabih, Ramin [2 ]
Scharstein, Daniel [3 ]
Veksler, Olga [4 ]
Kolmogorov, Vladimir [5 ]
Agarwala, Aseem [6 ]
Tappen, Marshall [7 ]
Rother, Carsten [8 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
[3] Middlebury Coll, Dept Comp Sci, Middlebury, VT 05753 USA
[4] Univ Western Ontario, Middlesex Coll 367, Dept Comp Sci, London, ON N6A 5B7, Canada
[5] UCL, London IP5 3RE, England
[6] Adobe Syst Inc, Seattle, WA 98103 USA
[7] Univ Cent Florida, Orlando, FL 32816 USA
[8] Microsoft Res Ltd, Cambridge CB3 0FB, England
关键词
performance evaluation; Markov random fields; global optimization; graph cuts; belief propagation;
D O I
10.1109/TPAMI.2007.70844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Among the most exciting advances in early vision has been the development of efficient energy minimization algorithms for pixel-labeling tasks such as depth or texture computation. It has been known for decades that such problems can be elegantly expressed as Markov random fields, yet the resulting energy minimization problems have been widely viewed as intractable. Recently, algorithms such as graph cuts and loopy belief propagation (LBP) have proven to be very powerful: For example, such methods form the basis for almost all the top-performing stereo methods. However, the trade-offs among different energy minimization algorithms are still not well understood. In this paper, we describe a set of energy minimization benchmarks and use them to compare the solution quality and runtime of several common energy minimization algorithms. We investigate three promising recent methods-graph cuts, LBP, and tree-reweighted message passing - in addition to the well-known older iterated conditional mode (ICM) algorithm. Our benchmark problems are drawn from published energy functions used for stereo, image stitching, interactive segmentation, and denoising. We also provide a general-purpose software interface that allows vision researchers to easily switch between optimization methods. The benchmarks, code, images, and results are available at http://vision.middlebury.edu/MRF/.
引用
收藏
页码:1068 / 1080
页数:13
相关论文
共 50 条
[1]   Panoramic video textures [J].
Agarwala, A ;
Zheng, KC ;
Pal, C ;
Agrawala, M ;
Cohen, M ;
Curless, B ;
Salesin, D ;
Szeliski, R .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :821-827
[2]   Interactive digital photomontage [J].
Agarwala, A ;
Dontcheva, M ;
Agrawala, M ;
Drucker, S ;
Colburn, A ;
Curless, B ;
Salesin, D ;
Cohen, M .
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03) :294-302
[3]   A survey of very large-scale neighborhood search techniques [J].
Ahuja, RK ;
Ergun, Ö ;
Orlin, JB ;
Punnen, AP .
DISCRETE APPLIED MATHEMATICS, 2002, 123 (1-3) :75-102
[4]  
[Anonymous], 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
[5]  
BARNEBY RC, 1989, INTERMOUNTAIN FLOR B, V3, P1
[6]  
BESAG J, 1986, J R STAT SOC B, V48, P259
[7]   A pixel dissimilarity measure that is insensitive to image sampling [J].
Birchfield, S ;
Tomasi, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (04) :401-406
[8]  
Blake A., 1987, Visual Reconstruction
[9]   Markov random fields with efficient approximations [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :648-655
[10]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239