OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields

被引:3146
作者
Cao, Zhe [1 ]
Hidalgo, Gines [2 ]
Simon, Tomas [3 ]
Wei, Shih-En [3 ]
Sheikh, Yaser [2 ]
机构
[1] Univ Calif Berkeley, Berkeley Artificial Intelligence Res Lab BAIR, Berkeley, CA 94709 USA
[2] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[3] Facebook Real Labs, Pittsburgh, PA 15213 USA
关键词
2D human pose estimation; 2D foot keypoint estimation; real-time; multiple person; part affinity fields; PICTORIAL STRUCTURES; PEOPLE; MODELS;
D O I
10.1109/TPAMI.2019.2929257
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Realtime multi-person 2D pose estimation is a key component in enabling machines to have an understanding of people in images and videos. In this work, we present a realtime approach to detect the 2D pose of multiple people in an image. The proposed method uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. This bottom-up system achieves high accuracy and realtime performance, regardless of the number of people in the image. In previous work, PAFs and body part location estimation were refined simultaneously across training stages. We demonstrate that a PAF-only refinement rather than both PAF and body part location refinement results in a substantial increase in both runtime performance and accuracy. We also present the first combined body and foot keypoint detector, based on an internal annotated foot dataset that we have publicly released. We show that the combined detector not only reduces the inference time compared to running them sequentially, but also maintains the accuracy of each component individually. This work has culminated in the release of Open Pose, the first open-source realtime system for multi-person 2D pose detection, including body, foot, hand, and facial keypoints.
引用
收藏
页码:172 / 186
页数:15
相关论文
共 77 条
[1]
2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]
Monocular 3D Pose Estimation and Tracking by Detection [J].
Andriluka, Mykhaylo ;
Roth, Stefan ;
Schiele, Bernt .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :623-630
[3]
Andriluka M, 2009, PROC CVPR IEEE, P1014, DOI 10.1109/CVPRW.2009.5206754
[4]
[Anonymous], 1955, NAVAL RES LOGISTICS
[5]
[Anonymous], 2010, bmvc
[6]
[Anonymous], 2018, P EUROPEAN C COMPUTE
[7]
[Anonymous], 2016, MSCOCO KEYPOINT EVAL
[8]
[Anonymous], 2017, P 31 INT C NEUR INF
[9]
[Anonymous], 2016, MSCOCO KEYPOINT LEAD
[10]
[Anonymous], PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.90