Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

被引：1868

作者：

Li Fei-Fei

Fergus, Rob

Perona, Pietro

机构：

[1] Princeton Univ, Princeton, NJ 08540 USA

[2] Univ Oxford, Oxford OX1 3PJ, England

[3] CALTECH, Pasadena, CA 91125 USA

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2007年 / 106卷 / 01期

关键词：

object recognition; categorization; generative model; incremental learning; Bayesian model;

D O I：

10.1016/j.cviu.2005.09.012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current computational approaches to learning visual object categories require thousands of training images, are slow, cannot learn in an incremental manner and cannot incorporate prior information into the learning process. In addition, no algorithm presented in the literature has been tested on more than a handful of object categories. We present all method for learning object categories from just a few training images. It is quick and it uses prior information in a principled way. We test it on a dataset composed of images of objects belonging to 101 widely varied categories. Our proposed method is based on making use of prior information, assembled from (unrelated) object categories which were previously learnt. A generative probabilistic model is used, which represents the shape and appearance of a constellation of features belonging to the object. The parameters of the model are learnt incrementally in a Bayesian manner. Our incremental algorithm is compared experimentally to an earlier batch Bayesian algorithm, as well as to one based on maximum likelihood. The incremental and batch versions have comparable classification performance on small training sets, but incremental learning is significantly faster, making real-time learning feasible. Both Bayesian methods outperform maximum likelihood on small training sets. (C) 2006 Elsevier Inc. All rights reserved.

引用

页码：59 / 70

页数：12

共 18 条

[1] A computational model for visual selection
Amit, Y
Geman, D
[J]. NEURAL COMPUTATION, 1999, 11 (07) : 1691 - 1715
[2] Attias H, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P21
[3] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
BIEDERMAN, I
[J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
[4] BURL MC, P EUR C COMP VIS, P628
[5] A Bayesian approach to unsupervised one-shot learning of object categories
Fei-Fei, L
Fergus, R
Perona, P
[J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1134 - 1141
[6] Fergus R, 2003, PROC CVPR IEEE, P264
[7] FIOLA P, 2001, P C COMP VIS PATT RE, V1, P511
[8] An introduction to variational methods for graphical models
Jordan, MI
Ghahramani, Z
Jaakkola, TS
Saul, LK
[J]. MACHINE LEARNING, 1999, 37 (02) : 183 - 233
[9] Saliency, scale and image description
Kadir, T
Brady, M
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 45 (02) : 83 - 105
[10] Lowe D. G., 1999, P IEEE INT C COMP VI, V2, P1150, DOI DOI 10.1109/ICCV.1999.790410

← 1 2 →