Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

被引:1868
作者
Li Fei-Fei
Fergus, Rob
Perona, Pietro
机构
[1] Princeton Univ, Princeton, NJ 08540 USA
[2] Univ Oxford, Oxford OX1 3PJ, England
[3] CALTECH, Pasadena, CA 91125 USA
关键词
object recognition; categorization; generative model; incremental learning; Bayesian model;
D O I
10.1016/j.cviu.2005.09.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current computational approaches to learning visual object categories require thousands of training images, are slow, cannot learn in an incremental manner and cannot incorporate prior information into the learning process. In addition, no algorithm presented in the literature has been tested on more than a handful of object categories. We present all method for learning object categories from just a few training images. It is quick and it uses prior information in a principled way. We test it on a dataset composed of images of objects belonging to 101 widely varied categories. Our proposed method is based on making use of prior information, assembled from (unrelated) object categories which were previously learnt. A generative probabilistic model is used, which represents the shape and appearance of a constellation of features belonging to the object. The parameters of the model are learnt incrementally in a Bayesian manner. Our incremental algorithm is compared experimentally to an earlier batch Bayesian algorithm, as well as to one based on maximum likelihood. The incremental and batch versions have comparable classification performance on small training sets, but incremental learning is significantly faster, making real-time learning feasible. Both Bayesian methods outperform maximum likelihood on small training sets. (C) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:59 / 70
页数:12
相关论文
共 18 条
  • [1] A computational model for visual selection
    Amit, Y
    Geman, D
    [J]. NEURAL COMPUTATION, 1999, 11 (07) : 1691 - 1715
  • [2] Attias H, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P21
  • [3] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
    BIEDERMAN, I
    [J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
  • [4] BURL MC, P EUR C COMP VIS, P628
  • [5] A Bayesian approach to unsupervised one-shot learning of object categories
    Fei-Fei, L
    Fergus, R
    Perona, P
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1134 - 1141
  • [6] Fergus R, 2003, PROC CVPR IEEE, P264
  • [7] FIOLA P, 2001, P C COMP VIS PATT RE, V1, P511
  • [8] An introduction to variational methods for graphical models
    Jordan, MI
    Ghahramani, Z
    Jaakkola, TS
    Saul, LK
    [J]. MACHINE LEARNING, 1999, 37 (02) : 183 - 233
  • [9] Saliency, scale and image description
    Kadir, T
    Brady, M
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 45 (02) : 83 - 105
  • [10] Lowe D. G., 1999, P IEEE INT C COMP VI, V2, P1150, DOI DOI 10.1109/ICCV.1999.790410