Pattern recognition with a Bayesian kernel combination machine

被引:22
作者
Damoulas, Theodoros [1 ]
Girolami, Mark A. [1 ]
机构
[1] Univ Glasgow, Fac Informat & Math Sci, Dept Comp Sci, Inference Grp, Glasgow G12 8QQ, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Classification; Kernel combination; MCMC; Probit regression; Bayesian inference; Information integration; BINARY;
D O I
10.1016/j.patrec.2008.08.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe a Bayesian classification method that informatively combines diverse sources of information and Multiple feature spaces for multiclasss problems. The proposed method is based on recent advances in kernel approaches where the integration of multiple object descriptors, or feature spaces, is achieved via kernel combination. Each kernel constructs a similarity metric between objects in a particular feature space and then having a common metric across modalities an overall combination can be constructed. We follow a hierarchical Bayesian approach, which introduces prior distributions over random variables and we construct a Gibbs sampling Markov chain Monte Carlo (MCMC) solution which is naturally derived from the employed multinomial probit likelihood. The methodology is the basis for possible deterministic approximations Such as variational or maximum-a-posteriori estimators, and it is compared against the well-known classifier combination methods on the classification of handwritten numerals. The results of the proposed method show a significant improvement over the best individual classifier and match the performance of the best Multiple classifier combination, whilst reducing the Computational requirements of combining classifiers and offering additional information on the significance of the contributing Sources. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:46 / 54
页数:9
相关论文
共 31 条
  • [1] BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA
    ALBERT, JH
    CHIB, S
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) : 669 - 679
  • [2] Bellman R., 1961, Adaptive Control Processes: A Guided Tour, DOI DOI 10.1515/9781400874668
  • [3] Berger J. O., 1985, Statistical decision theory and Bayesian analysis, V2nd
  • [4] Bishop C. M., 2009, Pattern Recognition and Machine Learning
  • [5] Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection
    Damoulas, Theodoros
    Girolami, Mark A.
    [J]. BIOINFORMATICS, 2008, 24 (10) : 1264 - 1270
  • [6] Denison DavidG. T., 2002, WILEY SERIES PROBABI
  • [7] Ensemble methods in machine learning
    Dietterich, TG
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 1 - 15
  • [8] Gelman A., 2004, Bayesian Data Analysis, Vsecond
  • [9] Girolami M., 2005, Proceedings of the 22nd International Conference on Machine Learning, P241, DOI DOI 10.1145/1102351.1102382
  • [10] GIROLAMI M, 2007, 20 ANN C NEUR INF PR