Robust Higher Order Potentials for Enforcing Label Consistency

被引:497
作者
Kohli, Pushmeet [1 ]
Ladicky, L'ubor [2 ]
Torr, Philip H. S. [2 ]
机构
[1] Microsoft Res, Cambridge, England
[2] Oxford Brookes Univ, Oxford OX3 0BP, England
基金
英国工程与自然科学研究理事会;
关键词
Discrete energy minimization; Markov and conditional random fields; Object recognition and segmentation;
D O I
10.1007/s11263-008-0202-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel framework for labelling problems which is able to combine multiple segmentations in a principled manner. Our method is based on higher order conditional random fields and uses potentials defined on sets of pixels (image segments) generated using unsupervised segmentation algorithms. These potentials enforce label consistency in image regions and can be seen as a generalization of the commonly used pairwise contrast sensitive smoothness potentials. The higher order potential functions used in our framework take the form of the Robust P (n) model and are more general than the P (n) Potts model recently proposed by Kohli et al. We prove that the optimal swap and expansion moves for energy functions composed of these potentials can be computed by solving a st-mincut problem. This enables the use of powerful graph cut based move making algorithms for performing inference in the framework. We test our method on the problem of multi-class object segmentation by augmenting the conventional crf used for object segmentation with higher order potentials defined on image regions. Experiments on challenging data sets show that integration of higher order potentials quantitatively and qualitatively improves results leading to much better definition of object boundaries. We believe that this method can be used to yield similar improvements for many other labelling problems.
引用
收藏
页码:302 / 324
页数:23
相关论文
共 49 条
  • [31] Lempitsky Victor., 2007, ICCV
  • [32] Levin A, 2006, LECT NOTES COMPUT SC, V3954, P581
  • [33] Lovasz Laszlo, 1983, Mathematical Programming the State of the Art, P235
  • [34] Orlin JB, 2007, LECT NOTES COMPUT SC, V4513, P240
  • [35] Texture synthesis via a noncausal nonparametric multiscale Markov random field
    Paget, R
    Longstaff, ID
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (06) : 925 - 931
  • [36] Potetz B., 2007, IEEE C COMP VIS PATT
  • [37] Rabinovich Andrew., 2006, CVPR, V1, P1130
  • [38] Ren XF, 2003, NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, P10
  • [39] Roth S, 2005, PROC CVPR IEEE, P860
  • [40] GrabCut - Interactive foreground extraction using iterated graph cuts
    Rother, C
    Kolmogorov, V
    Blake, A
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 309 - 314