Twin Gaussian Processes for Structured Prediction

被引:136
作者
Bo, Liefeng [2 ]
Sminchisescu, Cristian [1 ]
机构
[1] Univ Bonn, INS, D-53115 Bonn, Germany
[2] TTI Chicago, Chicago, IL 60637 USA
基金
美国国家科学基金会;
关键词
Structured prediction; Gaussian processes; 3d human pose reconstruction; Feature extraction; Video processing;
D O I
10.1007/s11263-008-0204-y
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We describe twin Gaussian processes (TGP), a generic structured prediction method that uses Gaussian process (GP) priors on both covariates and responses, both multivariate, and estimates outputs by minimizing the Kullback-Leibler divergence between two GP modeled as normal distributions over finite index sets of training and testing examples, emphasizing the goal that similar inputs should produce similar percepts and this should hold, on average, between their marginal distributions. TGP captures not only the interdependencies between covariates, as in a typical GP, but also those between responses, so correlations among both inputs and outputs are accounted for. TGP is exemplified, with promising results, for the reconstruction of 3d human poses from monocular and multicamera video sequences in the recently introduced HumanEva benchmark, where we achieve 5 cm error on average per 3d marker for models trained jointly, using data from multiple people and multiple activities. The method is fast and automatic: it requires no hand-crafting of the initial pose, camera calibration parameters, or the availability of a 3d body model associated with human subjects used for training or testing.
引用
收藏
页码:28 / 52
页数:25
相关论文
共 71 条
[1]
Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]
[Anonymous], ADV NEURAL INFORM PR
[3]
[Anonymous], ADV NEURAL INFORM PR
[4]
[Anonymous], 2005, PROC CVPR IEEE
[5]
[Anonymous], 2000, P 17 INT C MACHINE L
[6]
Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[7]
Bar-Shalom Y, 1988, Tracking and data association
[8]
Ambiguity in pictorial depth [J].
Battu, Balaraju ;
Kappers, Astrid M. L. ;
Koenderink, Jan J. .
PERCEPTION, 2007, 36 (09) :1290-1304
[9]
Bishop C., 2003, UNCERTAINTY ARTIFICI
[10]
Blake A, 1999, ADV NEUR IN, V11, P389