Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks

被引:482
作者
Tompson, Jonathan [1 ]
Stein, Murphy [1 ]
Lecun, Yann [1 ]
Perlin, Ken [1 ]
机构
[1] NYU, New York, NY 10012 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2014年 / 33卷 / 05期
关键词
Hand tracking; neural networks; markerless motion capture; analysis-by-synthesis;
D O I
10.1145/2629500
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a novel method for real-time continuous pose recovery of markerless complex articulable objects from a single depth image. Our method consists of the following stages: a randomized decision forest classifier for image segmentation, a robust method for labeled dataset generation, a convolutional network for dense feature extraction, and finally an inverse kinematics stage for stable real-time pose recovery. As one possible application of this pipeline, we show state-of-the-art results for real-time puppeteering of a skinned hand-model.
引用
收藏
页数:10
相关论文
共 35 条
  • [1] 3GEAR, 2014, 3GEAR SYST HAND TRAC
  • [2] The space of human body shapes: reconstruction and parameterization from range scans
    Allen, B
    Curless, B
    Popovic, Z
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03): : 587 - 594
  • [3] [Anonymous], 2013, ACM T GRAPH
  • [4] [Anonymous], 2011, TORCH7 MATLAB LIKE E
  • [5] [Anonymous], 2011, P BRIT MACH VIS C
  • [6] Ballan L, 2012, LECT NOTES COMPUT SC, V7577, P640, DOI 10.1007/978-3-642-33783-3_46
  • [7] Fast approximate energy minimization via graph cuts
    Boykov, Y
    Veksler, O
    Zabih, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) : 1222 - 1239
  • [8] Butler D.A., 2012, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '12, (New York, NY, USA), P1933, DOI DOI 10.1145/2208276.2208335
  • [9] Couprie C., 2013, P INT C LEARN REPR
  • [10] Vision-based hand pose estimation: A review
    Erol, Ali
    Bebis, George
    Nicolescu, Mircea
    Boyle, Richard D.
    Twombly, Xander
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 108 (1-2) : 52 - 73