A closed-form solution to non-rigid shape and motion recovery

被引:114
作者
Xiao, Jing
Chai, Jinxiang
Kanade, Takeo
机构
[1] Epson Palo Alto Lab, Palo Alto, CA 94304 USA
[2] CMU, Inst Robot, Pittsburgh, PA 15213 USA
关键词
non-rigid structure from motion; shape bases; rotation constraint; ambiguity; basis constraints; closed-form solution;
D O I
10.1007/s11263-005-3962-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recovery of three dimensional (313) shape and motion of non-static scenes from a monocular video sequence is important for applications like robot navigation and human computer interaction. If every point in the scene randomly moves, it is impossible to recover the non-rigid shapes. In practice, many non-rigid objects, e.g. the human face under various expressions, deform with certain structures. Their shapes can be regarded as a weighted combination of certain shape bases. Shape and motion recovery under such situations has attracted much interest. Previous work on this problem (Bregler, C., Hertzmann, A., and Biermann, H. 2000. In Proc. Int. Conf. Computer Vision and Pattern Recognition; Brand, M. 2001. In Proc. Int. Conf. Computer Vision and Pattern Recognition; Torresani, L., Yang, D., Alexander, G., and Bregler, C. 2001. In Proc. Int. Conf. Computer Vision and Pattern Recognition) utilized only orthonormality constraints on the camera rotations (rotation constraints). This paper proves that using only the rotation constraints results in ambiguous and invalid solutions. The ambiguity arises from the fact that the shape bases are not unique. An arbitrary linear transformation of the bases produces another set of eligible bases. To eliminate the ambiguity, we propose a set of novel constraints, basis constraints, which uniquely determine the shape bases. We prove that, under the weak-perspective projection model, enforcing both the basis and the rotation constraints leads to a closed-form solution to the problem of non-rigid shape and motion recovery. The accuracy and robustness of our closed-form solution is evaluated quantitatively on synthetic data and qualitatively on real video sequences.
引用
收藏
页码:233 / 246
页数:14
相关论文
共 24 条
[1]  
BAKER S, 2001, P INT C COMP VIS PAT
[2]   Separability of pose and expression in facial tracking and animation [J].
Bascle, B ;
Blake, A .
SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, :323-328
[3]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[4]  
BRAND M, 2001, P INT C COMP VIS PAT
[5]  
BREGLER C, 2000, P INT C COMP VIS PAT
[6]  
CHAI J, 2003, EUR ACM S COMP AN
[7]   A multibody factorization method for independently moving objects [J].
Costeira, JP ;
Kanade, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 29 (03) :159-179
[8]  
GOKTURK SB, 2001, P INT C COMP VIS
[9]  
HAN M, 2000, P INT C COMP VIS PAT
[10]  
HARTLEY RI, 2000, MULITPLE VIEW GEOMET