Combining feature spaces for classification

被引:51
作者
Damoulas, Theodoros [1 ]
Girolami, Mark A. [1 ]
机构
[1] Univ Glasgow, Dept Comp Sci, Fac Informat & Math Sci, Inference Res Grp, Glasgow G12 8QQ, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Variational Bayes approximation; Multiclass classification; Kernel combination; Hierarchical Bayes; Bayesian inference; Ensemble learning; Multi-modal modelling; Information integration; PROTEIN FOLD RECOGNITION; EMPIRICAL-ANALYSIS; KERNEL;
D O I
10.1016/j.patcog.2009.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we offer a variational Bayes approximation to the multinomial probit model for basis expansion and kernel combination. Our model is well-founded within a hierarchical Bayesian framework and is able to instructively combine available sources of information for multinomial classification. The proposed framework enables informative integration of possibly heterogeneous Sources in a Multitude of ways, from the simple Summation of feature expansions to weighted product of kernels, and it is shown to match and in certain cases outperform the well-known ensemble learning approaches of combining individual classifiers. At the same time the approximation reduces considerably the CPU time and resources required with respect to both the ensemble learning methods and the full Markov chain Monte Carlo, Metropolis-Hastings within Gibbs solution of our model. We present our proposed framework together with extensive experimental studies on synthetic and benchmark datasets and also for the first time report a comparison between summation and product of individual kernels as possible different methods for constructing the composite kernel Matrix. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2671 / 2683
页数:13
相关论文
共 36 条
  • [1] BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA
    ALBERT, JH
    CHIB, S
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) : 669 - 679
  • [2] An introduction to MCMC for machine learning
    Andrieu, C
    de Freitas, N
    Doucet, A
    Jordan, MI
    [J]. MACHINE LEARNING, 2003, 50 (1-2) : 5 - 43
  • [3] BAI L, 2003, P 23 ART INT C
  • [4] Beal M. J., 2003, VARIATIONAL ALGORITH
  • [5] Berger J. O., 1985, Statistical decision theory and Bayesian analysis, V2nd
  • [6] Bishop C., 2006, BOOK REV PATTERNRECO, DOI DOI 10.1117/1.2819119
  • [7] DAMOULAS T, 2008, IEEE INT C MACH LEAR
  • [8] Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection
    Damoulas, Theodoros
    Girolami, Mark A.
    [J]. BIOINFORMATICS, 2008, 24 (10) : 1264 - 1270
  • [9] Pattern recognition with a Bayesian kernel combination machine
    Damoulas, Theodoros
    Girolami, Mark A.
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (01) : 46 - 54
  • [10] DEFREITAS N, 2001, P 17 C UNC ART INT