Pictorial structures for object recognition

被引:1380
作者
Felzenszwalb, PF [1 ]
Huttenlocher, DP
机构
[1] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
part-based object recognition; statistical models; energy minimization;
D O I
10.1023/B:VISI.0000042934.15159.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a computationally efficient framework for part-based modeling and recognition of objects. Our work is motivated by the pictorial structure models introduced by Fischler and Elschlager. The basic idea is to represent an object by a collection of parts arranged in a deformable configuration. The appearance of each part is modeled separately, and the deformable configuration is represented by spring-like connections between pairs of parts. These models allow for qualitative descriptions of visual appearance, and are suitable for generic recognition problems. We address the problem of using pictorial structure models to find instances of an object in an image as well as the problem of learning an object model from training examples, presenting efficient algorithms in both cases. We demonstrate the techniques by learning models that represent faces and human bodies and using the resulting models to locate the corresponding objects in novel images.
引用
收藏
页码:55 / 79
页数:25
相关论文
共 41 条