Pose Estimation and Segmentation of People in 3D Movies

被引：16

作者：

Alahari, Karteek ^{[1
]}

Seguin, Guillaume ^{[2
]}

Sivic, Josef ^{[1
]}

Laptev, Ivan ^{[1
]}

机构：

[1] Inria, Valbonne, France

[2] Ecole Normale Super, F-75231 Paris, France

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年

关键词：

D O I：

10.1109/ICCV.2013.263

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

We seek to obtain a pixel-wise segmentation and pose estimation of multiple people in a stereoscopic video. This involves challenges such as dealing with unconstrained stereoscopic video, non-stationary cameras, and complex indoor and outdoor dynamic scenes. The contributions of our work are two-fold: First, we develop a segmentation model incorporating person detection, pose estimation, as well as colour, motion, and disparity cues. Our new model explicitly represents depth ordering and occlusion. Second, we introduce a stereoscopic dataset with frames extracted from feature-length movies "StreetDance 3D" and "Pina". The dataset contains 2727 realistic stereo pairs and includes annotation of human poses, person bounding boxes, and pixel-wise segmentations for hundreds of people. The dataset is composed of indoor and outdoor scenes depicting multiple people with frequent occlusions. We demonstrate results on our new challenging dataset, as well as on the H2view dataset from (Sheasby et al. ACCV 2012).

引用

页码：2112 / 2119

页数：8