Human Pose Estimation with Iterative Error Feedback

被引:480
作者
Carreira, Joao [1 ,2 ]
Agrawal, Pulkit [1 ]
Fragkiadaki, Katerina [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Google DeepMind, London, England
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.512
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Hierarchical feature extractors such as Convolutional Networks (ConvNets) have achieved impressive performance on a variety of classification tasks using purely feedforward processing. Feedforward architectures can learn rich representations of the input space but do not explicitly model dependencies in the output spaces, that are quite structured for tasks such as articulated human pose estimation or object segmentation. Here we propose a framework that expands the expressive power of hierarchical feature extractors to encompass both input and output spaces, by introducing top-down feedback. Instead of directly predicting the outputs in one go, we use a self-correcting model that progressively changes an initial solution by feeding back error predictions, in a process we call Iterative Error Feedback (IEF). IEF shows excellent performance on the task of articulated pose estimation in the challenging MPII and LSP benchmarks, matching the state-of-the-art without requiring ground truth scale annotation.
引用
收藏
页码:4733 / 4742
页数:10
相关论文
共 48 条
[1]
2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]
[Anonymous], CVPR
[3]
[Anonymous], ICLR 2015
[4]
[Anonymous], COMBINING LOCAL APPE
[5]
[Anonymous], 2010, BMVC
[6]
[Anonymous], 2014, ARXIV14075104
[7]
[Anonymous], ARXIV14072538
[8]
[Anonymous], 2015, Efficient object localization using convolutional networks
[9]
[Anonymous], IDIAPRR5620070
[10]
[Anonymous], 2014, P BRIT MACH VIS C 20