An Interactive Approach to Semantic Modeling of Indoor Scenes with an RGBD Camera

被引:150
作者
Shao, Tianjia [2 ]
Xu, Weiwei [1 ]
Zhou, Kun [3 ]
Wang, Jingdong
Li, Dongping [3 ]
Guo, Baining
机构
[1] Hangzhou Normal Univ, Hangzhou, Zhejiang, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou, Zhejiang, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2012年 / 31卷 / 06期
关键词
Indoor scene; Depth images; Segmentation; Labeling; Random Regress Forest; OBJECT RECOGNITION; 3D;
D O I
10.1145/2366145.2366155
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present an interactive approach to semantic modeling of indoor scenes with a consumer-level RGBD camera. Using our approach, the user first takes an RGBD image of an indoor scene, which is automatically segmented into a set of regions with semantic labels. If the segmentation is not satisfactory, the user can draw some strokes to guide the algorithm to achieve better results. After the segmentation is finished, the depth data of each semantic region is used to retrieve a matching 3D model from a database. Each model is then transformed according to the image depth to yield the scene. For large scenes where a single image can only cover one part of the scene, the user can take multiple images to construct other parts of the scene. The 3D models built for all images are then transformed and unified into a complete scene. We demonstrate the efficiency and robustness of our approach by modeling several real-world scenes.
引用
收藏
页数:11
相关论文
共 39 条
[1]  
Anand A., 2011, ABS11115358 CORR
[2]  
[Anonymous], 2009, ICCV
[3]  
[Anonymous], 2011, Advances in neural information processing systems
[4]  
Blum Manuel, 2011, NIPS WORKSH DEEP LEA
[5]  
Bo LF, 2011, IEEE INT C INT ROBOT, P821, DOI 10.1109/IROS.2011.6048717
[6]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Shape Google: Geometric Words and Expressions for Invariant Shape Retrieval [J].
Bronstein, Alexander M. ;
Bronstein, Michael M. ;
Guibas, Leonidas J. ;
Ovsjanikov, Maks .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (01)
[9]  
Clark J., 2011, Computer, V44, P81, DOI 10.1109/MC.2011.320
[10]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157