Multidimensional morphable models: A framework for representing and matching object classes

被引:60
作者
Jones, MJ
Poggio, T
机构
[1] Digital Equipment Corp, Cambridge Res Lab, Cambridge, MA 02139 USA
[2] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
[3] MIT, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
object representations; image analysis; correspondence; object recognition;
D O I
10.1023/A:1008074226832
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a flexible model for representing images of objects of a certain class, known a priori, such as faces, and introduce a new algorithm for matching it to a novel image and thereby perform image analysis. The flexible model, known as a multidimensional morphable model, is learned from example images of objects of a class. In this paper we introduce an effective stochastic gradient descent algorithm that automatically matches a model to a novel image. Several experiments demonstrate the robustness and the broad range of applicability of morphable models. Our approach can provide novel solutions to several vision tasks, including the computation of image correspondence, object verification and image compression.
引用
收藏
页码:107 / 131
页数:25
相关论文
共 46 条
  • [1] ATICK J, 1995, NEURAL COMPUTATION
  • [2] BERGEN JR, 1990, HIERARCHICAL MOTION
  • [3] THREE-DIMENSIONAL OBJECT RECOGNITION.
    Besl, Paul J.
    Jain, Ramesh C.
    [J]. Computing surveys, 1985, 17 (01): : 75 - 145
  • [4] Image representations for visual learning
    Beymer, D
    Poggio, T
    [J]. SCIENCE, 1996, 272 (5270) : 1905 - 1909
  • [5] BEYMER D, 1995, THESIS MIT
  • [6] BEYMER D, 1995, 1536 AI MIT
  • [7] BEYMER D, 1995, 1537 AI MIT
  • [8] BEYMER D, 1993, 1431 AI MIT
  • [9] Blake A., 1994, Computer Graphics Proceedings. Annual Conference Series 1994. SIGGRAPH 94 Conference Proceedings, P185, DOI 10.1145/192161.192197
  • [10] HOW ARE 3-DIMENSIONAL OBJECTS REPRESENTED IN THE BRAIN
    BULTHOFF, HH
    EDELMAN, SY
    TARR, MJ
    [J]. CEREBRAL CORTEX, 1995, 5 (03) : 247 - 260