Parsing human skeletons in an operating room

被引:38
作者
Belagiannis, Vasileios [1 ,2 ]
Wang, Xinchao [3 ]
Ben Shitrit, Horesh Beny [3 ]
Hashimoto, Kiyoshi [4 ]
Stauder, Ralf [1 ]
Aoki, Yoshimitsu [4 ]
Kranzfelder, Michael [5 ]
Schneider, Armin [5 ]
Fua, Pascal [3 ]
Ilic, Slobodan [1 ,6 ]
Feussner, Hubertus [5 ]
Navab, Nassir [1 ,7 ]
机构
[1] Tech Univ Munich, Comp Aided Med Procedures, Munich, Germany
[2] Univ Oxford, VGG, Oxford, England
[3] Ecole Polytech Fed Lausanne, CVLAB, Lausanne, Switzerland
[4] Keio Univ, Aoki Media Sensing Lab, Tokyo, Japan
[5] Tech Univ Munich, Klinikum Rechts Isar, MITI, Munich, Germany
[6] Siemens AG, Munich, Germany
[7] Johns Hopkins Univ, Baltimore, MD USA
基金
瑞士国家科学基金会;
关键词
Human pose estimation; Part-based model; Medical workflow analysis; POSE ESTIMATION; PICTORIAL STRUCTURES; MOTION CAPTURE; REGRESSION; TRACKING; PEOPLE;
D O I
10.1007/s00138-016-0792-4
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Multiple human pose estimation is an important yet challenging problem. In an operating room (OR) environment, the 3D body poses of surgeons and medical staff can provide important clues for surgical workflow analysis. For that purpose, we propose an algorithm for localizing and recovering body poses of multiple human in an OR environment under a multi-camera setup. Our model builds on 3D Pictorial Structures and 2D body part localization across all camera views, using convolutional neural networks (ConvNets). To evaluate our algorithm, we introduce a dataset captured in a real OR environment. Our dataset is unique, challenging and publicly available with annotated ground truths. Our proposed algorithm yields to promising pose estimation results on this dataset.
引用
收藏
页码:1035 / 1046
页数:12
相关论文
共 60 条
[1]
Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]
Pose Estimation and Segmentation of People in 3D Movies [J].
Alahari, Karteek ;
Seguin, Guillaume ;
Sivic, Josef ;
Laptev, Ivan .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2112-2119
[3]
Andriluka M., 2008, COMPUTERVISION PATTE, P1
[4]
Monocular 3D Pose Estimation and Tracking by Detection [J].
Andriluka, Mykhaylo ;
Roth, Stefan ;
Schiele, Bernt .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :623-630
[5]
Andriluka M, 2009, PROC CVPR IEEE, P1014, DOI 10.1109/CVPRW.2009.5206754
[6]
[Anonymous], 2012, 2012 IEEE INT C ROBO
[7]
[Anonymous], 2014, P AS C COMP VIS
[8]
[Anonymous], AS C COMP VIS ACCV 2
[9]
[Anonymous], 2015, THESIS
[10]
[Anonymous], COMP VIS PATT REC 20