Dirichlet-based Histogram Feature Transform for Image Classification

被引:54
作者
Kobayashi, Takumi [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
D O I
10.1109/CVPR.2014.413
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Histogram-based features have significantly contributed to recent development of image classifications, such as by SIFT local descriptors. In this paper, we propose a method to efficiently transform those histogram features for improving the classification performance. The (L1-normalized) histogram feature is regarded as a probability mass function, which is modeled by Dirichlet distribution. Based on the probabilistic modeling, we induce the Dirichlet Fisher kernel for transforming the histogram feature vector. The method works on the individual histogram feature to enhance the discriminative power at a low computational cost. On the other hand, in the bag-of-feature (BoF) framework, the Dirichlet mixture model can be extended to Gaussian mixture by transforming histogram-based local descriptors, e.g., SIFT, and thereby we propose the method of Dirichlet-derived GMM Fisher kernel. In the experiments on diverse image classification tasks including recognition of subordinate objects and material textures, the proposed methods improve the performance of the histogram-based features and BoF-based Fisher kernel, being favorably competitive with the state-of-the-arts.
引用
收藏
页码:3278 / 3285
页数:8
相关论文
共 47 条
[11]  
[Anonymous], 2010, CVPR
[12]  
[Anonymous], ECCV
[13]  
[Anonymous], ICCV
[14]  
[Anonymous], 2012, CVPR
[15]  
[Anonymous], 2012, ECCV
[16]  
[Anonymous], CVPR
[17]  
[Anonymous], CVPR
[18]  
[Anonymous], 2009, J VISUAL-JAPAN, DOI [DOI 10.1167/9.8.784, 10.1167/9.8.784]
[19]  
[Anonymous], NIPS
[20]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893