Fast approximate energy minimization via graph cuts

被引:4892
作者
Boykov, Y
Veksler, O
Zabih, R
机构
[1] Siemens Corp, Res, Princeton, NJ 08540 USA
[2] NEC Res Inst, Princeton, NJ 08540 USA
[3] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
energy minimization; early vision; graph algorithms; minimum cut; maximum flow; stereo; motion; image restoration; Markov Random Fields; Potts model; multiway cut;
D O I
10.1109/34.969114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many tasks in computer vision involve assigning a label (such as disparity) to every pixel. A common constraint is that the labels should vary smoothly almost everywhere while preserving sharp discontinuities that may exist, e.g., at object boundaries. These tasks are naturally stated in terms of energy minimization. In this paper, we consider a wide class of energies with various smoothness constraints. Global minimization of these energy functions is NP-hard even in the simplest discontinuity-preserving case. Therefore, our focus is on efficient approximation algorithms. We present two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves. These moves can simultaneously change the labels of arbitrarily large sets of pixels. In contrast, many standard algorithms (including simulated annealing) use small moves where only one pixel changes its label at a time. Our expansion algorithm finds a labeling within a known factor of the global minimum, while our swap algorithm handles more general energy functions. Both of these algorithms allow important cases of discontinuity preserving energies. We experimentally demonstrate the effectiveness of our approach for image restoration, stereo and motion. On real data with ground truth, we achieve 98 percent accuracy.
引用
收藏
页码:1222 / 1239
页数:18
相关论文
共 46 条
  • [1] Ahuja RK, 1993, NETWORK FLOWS THEORY
  • [2] USING DYNAMIC-PROGRAMMING FOR SOLVING VARIATIONAL-PROBLEMS IN VISION
    AMINI, AA
    WEYMOUTH, TE
    JAIN, RC
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (09) : 855 - 867
  • [3] [Anonymous], 1999, THESIS CORNELL U
  • [4] BARKER S, 1998, P IEEE INT C AC SPEE, V5, P2757
  • [5] BESAG J, 1986, J R STAT SOC B, V48, P259
  • [6] A pixel dissimilarity measure that is insensitive to image sampling
    Birchfield, S
    Tomasi, C
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (04) : 401 - 406
  • [7] BIRCHFIELD S, 1999, INT J COMPUT VISION, V35, P1
  • [8] BIRCHFIELD S, 1999, THESIS STANFORD U
  • [9] Blake V., 1989, COLLECTION MANAGEMEN, V11, P1
  • [10] Markov random fields with efficient approximations
    Boykov, Y
    Veksler, O
    Zabih, R
    [J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 648 - 655