Learning spatial relations in object recognition

被引:13
作者
Pham, Thang V. [1 ]
Smeulders, Arnold W. M. [1 ]
机构
[1] Univ Amsterdam, Inst Informat, ISIS, NL-1098 SJ Amsterdam, Netherlands
关键词
articulated object; Bayesian network; deformable model; part-based approach; shape; spatial relation;
D O I
10.1016/j.patrec.2006.03.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies two types of spatial relationships that can be learned from training examples for object recognition. The first one employs deformable relationships between object parts with a Gaussian model, while the second one describes pairwise relationships between pixel intensity values using Bayesian networks. We perform experiments on a human face dataset and a horse dataset, imposing the same amount of annotation of training data, which can be seen as sending knowledge to the learning algorithms. The result indicates that the Bayesian network method compares favorably to the deformable model, as it can capture long-distance stable relations in the object appearance. We also conclude that both methods are superior to strictly spatial matching by template and strictly non-spatial classifiers. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1673 / 1684
页数:12
相关论文
共 26 条
[1]   Graph matching for visual object recognition [J].
Bunke, H .
SPATIAL VISION, 2000, 13 (2-3) :335-340
[2]   Recognition of planar object classes [J].
Burl, MC ;
Perona, P .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :223-230
[3]   Face detection with information-based maximum discrimination [J].
Colmenarez, AJ ;
Huang, TS .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :782-787
[4]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[5]  
Deselaers T, 2005, PROC CVPR IEEE, P157
[6]   Query processing in spatial-query-by-sketch [J].
Egenhofer, MJ .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 1997, 8 (04) :403-424
[7]  
Felzenszwalb PF, 2000, PROC CVPR IEEE, P66, DOI 10.1109/CVPR.2000.854739
[8]  
Fergus R, 2003, PROC CVPR IEEE, P264
[9]   REPRESENTATION AND MATCHING OF PICTORIAL STRUCTURES [J].
FISCHLER, MA ;
ELSCHLAGER, RA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (01) :67-92
[10]  
Fussenegger M., 2004, PATTERN RECOGN, V3, P41