Real-Time Human Pose Recognition in Parts from Single Depth Images

被引:1176
作者
Shotton, Jamie [1 ]
Sharp, Toby [1 ]
Kipman, Alex
Fitzgibbon, Andrew [1 ]
Finocchio, Mark
Blake, Andrew [1 ]
Cook, Mat [1 ]
Moore, Richard
机构
[1] Microsoft Res, Cambridge, England
关键词
All Open Access; Green;
D O I
10.1145/2398356.2398381
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new method to quickly and accurately - predict human pose-the 3D positions of body joints-from a single depth image, without depending on information from preceding frames. Our approach is strongly rooted in current object recognition strategies. By designing an intermediate - representation in terms of body parts, the difficult pose estimation problem is transformed into a simpler per-pixel classification problem, for which efficient machine learning techniques exist. By using computer graphics to synthesize a very large dataset of training image pairs, one can train a classifier that estimates body part labels from test images invariant to pose, body shape, clothing, and other irrelevances. Finally, we generate confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes. The system runs in under 5ms on the Xbox 360. Our evaluation shows high accuracy on both synthetic and real test sets, and investigates the effect of several training parameters. We achieve state-of-the-art accuracy in our comparison with related work and demonstrate improved generalization over exact whole-skeleton nearest neighbor matching.
引用
收藏
页码:116 / 124
页数:9
相关论文
共 27 条