Discriminative Learning with Latent Variables for Cluttered Indoor Scene Understanding

被引:4
作者
Wang, Huayan [1 ]
Gould, Stephen [2 ]
Koller, Daphne [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 0200, Australia
基金
美国国家科学基金会;
关键词
LAYOUT;
D O I
10.1145/2436256.2436276
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of understanding an indoor scene from a single image in terms of recovering the room geometry (floor, ceiling, and walls) and furniture layout. A major challenge of this task arises from the fact that most indoor scenes are cluttered by furniture and decorations, whose appearances vary drastically across scenes, thus can hardly be modeled (or even hand-labeled) consistently. In this paper we tackle this problem by introducing latent variables to account for clutter, so that the observed image is jointly explained by the room and clutter layout. Model parameters are learned from a training set of images that are only labeled with the layout of the room geometry. Our approach enables taking into account and inferring indoor clutter without hand-labeling of the clutter in the training set, which is often inaccurate. Yet it outperforms the state-of-the-art method of Hedau et al.(7) that requires clutter labels. As a latent variable based method, our approach has an interesting feature that latent variables are used in direct correspondence with a concrete visual concept (clutter in the room) and thus interpretable.
引用
收藏
页码:92 / 99
页数:8
相关论文
共 20 条
  • [1] [Anonymous], IEEE C COMP VIS PATT
  • [2] [Anonymous], 2010, NIPS
  • [3] [Anonymous], 2009, P INT C COMP VIS ICC
  • [4] [Anonymous], 2009, ICCV
  • [5] BESAG J, 1986, J R STAT SOC B, V48, P259
  • [6] Mean shift: A robust approach toward feature space analysis
    Comaniciu, D
    Meer, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 603 - 619
  • [7] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [8] Geiger A., 2011, Advances in Neural Information Processing Systems, V24, P1467
  • [9] Gupta A., 2011, COMP VIS PATT REC CV
  • [10] Hedau V, 2010, LECT NOTES COMPUT SC, V6316, P224, DOI 10.1007/978-3-642-15567-3_17