Harmony Potentials

被引:62
作者
Boix, Xavier [1 ,3 ]
Gonfaus, Josep M. [1 ,2 ]
van de Weijer, Joost [1 ,2 ]
Bagdanov, Andrew D. [1 ]
Serrat, Joan [1 ,2 ]
Gonzalez, Jordi [1 ,2 ]
机构
[1] Ctr Visio Comp, Barcelona, Spain
[2] Univ Autonoma Barcelona, Dept Comp Sci, E-08193 Barcelona, Spain
[3] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
关键词
Semantic object segmentation; Hierarchical conditional random fields; ENERGY MINIMIZATION; TEXTURE; CLASSIFICATION; SEGMENTATION; FEATURES; LAYOUT; SCALE;
D O I
10.1007/s11263-011-0449-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hierarchical Conditional Random Field (HCRF) model have been successfully applied to a number of image labeling problems, including image segmentation. However, existing HCRF models of image segmentation do not allow multiple classes to be assigned to a single region, which limits their ability to incorporate contextual information across multiple scales. At higher scales in the image, this representation yields an oversimplified model since multiple classes can be reasonably expected to appear within large regions. This simplified model particularly limits the impact of information at higher scales. Since class-label information at these scales is usually more reliable than at lower, noisier scales, neglecting this information is undesirable. To address these issues, we propose a new consistency potential for image labeling problems, which we call the harmony potential. It can encode any possible combination of labels, penalizing only unlikely combinations of classes. We also propose an effective sampling strategy over this expanded label set that renders tractable the underlying optimization problem. Our approach obtains state-of-the-art results on two challenging, standard benchmark datasets for semantic image segmentation: PASCAL VOC 2010, and MSRC-21.
引用
收藏
页码:83 / 102
页数:20
相关论文
共 82 条
  • [71] Evaluating Color Descriptors for Object and Scene Recognition
    van de Sande, Koen E. A.
    Gevers, Theo
    Snoek, Cees G. M.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1582 - 1596
  • [72] Learning Color Names for Real-World Applications
    van de Weijer, Joost
    Schmid, Cordelia
    Verbeek, Jakob
    Larlus, Diane
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1512 - 1523
  • [73] Describing Reflectances for Color Segmentation Robust to Shadows, Highlights, and Textures
    Vazquez, Eduard
    Baldrich, Ramon
    van de Weijer, Joost
    Vanrell, Maria
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 917 - 930
  • [74] Vedaldi A., 2008, LECT NOTES COMPUT SC, DOI 10.1007/978-3-540-88693-8\\_52
  • [75] Verbeek J., 2008, Advances in Neural Information Processing Systems, P1553
  • [76] Graphical Models, Exponential Families, and Variational Inference
    Wainwright, Martin J.
    Jordan, Michael I.
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2008, 1 (1-2): : 1 - 305
  • [77] Winn J, 2005, IEEE I CONF COMP VIS, P1800
  • [78] Global Stereo Reconstruction under Second-Order Smoothness Priors
    Woodford, Oliver
    Torr, Philip
    Reid, Ian
    Fitzgibbon, Andrew
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (12) : 2115 - 2128
  • [79] Yang J., 2009, P COMP VIS PATT REC
  • [80] Yang Y., 2010, P COMP VIS PATT REC