PWP3D: Real-Time Segmentation and Tracking of 3D Objects

被引:192
作者
Prisacariu, Victor A. [1 ]
Reid, Ian D. [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
基金
英国工程与自然科学研究理事会;
关键词
Level set; Region based; Pose recovery; 3D tracking; Cuda; GPGPU; Real time; Segmentation; Tracking; IMAGE SEGMENTATION; MOTION;
D O I
10.1007/s11263-011-0514-3
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We formulate a probabilistic framework for simultaneous region-based 2D segmentation and 2D to 3D pose tracking, using a known 3D model. Given such a model, we aim to maximise the discrimination between statistical foreground and background appearance models, via direct optimisation of the 3D pose parameters. The foreground region is delineated by the zero-level-set of a signed distance embedding function, and we define an energy over this region and its immediate background surroundings based on pixel-wise posterior membership probabilities (as opposed to likelihoods). We derive the differentials of this energy with respect to the pose parameters of the 3D object, meaning we can conduct a search for the correct pose using standard gradient-based non-linear minimisation techniques. We propose novel enhancements at the pixel level based on temporal consistency and improved online appearance model adaptation. Furthermore, straightforward extensions of our method lead to multi-camera and multi-object tracking as part of the same framework. The parallel nature of much of the processing in our algorithm means it is amenable to GPU acceleration, and we give details of our real-time implementation, which we use to generate experimental results on both real and artificial video sequences, with a number of 3D models. These experiments demonstrate the benefit of using pixel-wise posteriors rather than likelihoods, and showcase the qualities, such as robustness to occlusions and motion blur (and also some failure modes), of our tracker.
引用
收藏
页码:335 / 354
页数:20
相关论文
共 38 条
[1]
[Anonymous], 2000, Ph.D. thesis,
[2]
[Anonymous], 2009, NVIDIA CUDA Programming Guide
[3]
[Anonymous], 2004, DISTANCE TRANSFORMS
[4]
[Anonymous], 2007, Numerical Recipes
[5]
Bibby C, 2008, LECT NOTES COMPUT SC, V5303, P831, DOI 10.1007/978-3-540-88688-4_61
[6]
INFERRING SURFACES FROM IMAGES [J].
BINFORD, TO .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :205-244
[7]
Bouguet J-Y, 2008, Camera calibration toolbox for Matlab
[8]
Combined Region and Motion-Based 3D Tracking of Rigid and Articulated Objects [J].
Brox, Thomas ;
Rosenhahn, Bodo ;
Gall, Juergen ;
Cremers, Daniel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) :402-415
[9]
A review of statistical approaches to level set segmentation: Integrating color, texture, motion and shape [J].
Cremers, Daniel ;
Rousson, Mikael ;
Deriche, Rachid .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (02) :195-215
[10]
Kernel density estimation and intrinsic alignment for shape priors in level set segmentation [J].
Cremers, Daniel ;
Osher, Stanley J. ;
Soatto, Stefano .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 69 (03) :335-351