Monocular 3D Pose Estimation and Tracking by Detection

被引:254
作者
Andriluka, Mykhaylo [1 ]
Roth, Stefan [1 ]
Schiele, Bernt [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
RECOGNITION;
D O I
10.1109/CVPR.2010.5540156
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Automatic recovery of 3D human pose from monocular image sequences is a challenging and important research topic with numerous applications. Although current methods are able to recover 3D pose for a single person in controlled environments, they are severely challenged by real-world scenarios, such as crowded street scenes. To address this problem, we propose a three-stage process building on a number of recent advances. The first stage obtains an initial estimate of the 2D articulation and viewpoint of the person from single frames. The second stage allows early data association across frames based on tracking-by-detection. These two stages successfully accumulate the available 2D image evidence into robust estimates of 2D limb positions over short image sequences (= tracklets). The third and final stage uses those tracklet-based estimates as robust image observations to reliably recover 3D pose. We demonstrate state-of-the-art performance on the HumanEva II benchmark, and also show the applicability of our approach to articulated 3D tracking in realistic street conditions.
引用
收藏
页码:623 / 630
页数:8
相关论文
共 31 条
[1]
Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]
Andriluka M., CVPR 08
[3]
Andriluka M., CVPR 09
[4]
[Anonymous], 2006, HUMANEVA SYNCHRONIZ
[5]
BALAN AO, CVPR 07
[6]
BELONGIE S, NIPS 00
[7]
Articulated body motion capture by stochastic search [J].
Deutscher, J ;
Reid, I .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (02) :185-205
[8]
Eichner M., BMVC 09
[9]
FELZENSSZWALB PF, CVPR 08
[10]
Pictorial structures for object recognition [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) :55-79