Progressive search space reduction for human pose estimation

被引：345

作者：

Ferrari, Vittorio ^{[1
]}

Marin-Jimenez, Manuel ^{[2
]}

Zisserman, Andrew ^{[1
]}

机构：

[1] Univ Oxford, Oxford OX1 2JD, England

[2] Univ Granada, E-18071 Granada, Spain

来源：

2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12 | 2008年

关键词：

D O I：

10.1109/CVPR.2008.4587468

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The objective of this paper is to estimate 2D human pose as a spatial configuration of body parts in TV and movie video shots. Such video material is uncontrolled and extremely challenging. We propose an approach that progressively reduces the search space for body parts, to greatly improve the chances that pose estimation will succeed. This involves two contributions: (i) a generic detector using a weak model of pose to substantially reduce the full pose search space; and (ii) employing 'grabcut' initialized on detected regions proposed by the weak model, to further prune the search space. Moreover, we also propose (iii) an integrated spatio-temporal model covering multiple frames to refine pose estimates from individual frames, with inference using belief propagation. The method is fully automatic and self-initializing, and explains the spatio-temporal volume covered by a person moving in a shot, by soft-labeling every pixel as belonging to a particular body part or to the background. We demonstrate upper-body pose estimation by an extensive evaluation over 70000 frames from four episodes of the TV series Buffy the vampire slayer, and present an application to full-body action recognition on the Weizmann dataset.

引用

页数：8

共 26 条

[1]

AGARWAL A, ECCV04

[2]

[Anonymous], 2007, ICCV WORKSH HUM MOT

[3]

[Anonymous], 2004, CVPR

[4]

[Anonymous], 2007, CVPR

[5]

[Anonymous], 2005, IJCV

[6]

[Anonymous], 2006, ECCV

[7]

[Anonymous], 2004, SIGGRAPH

[8]

Blank M., 2005, ICCV

[9]

Bray M., 2006, ECCV

[10] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

← 1 2 3 →