The More You Know: Using Knowledge Graphs for Image Classification

被引:177
作者
Marino, Kenneth [1 ]
Salakhutdinov, Ruslan [1 ]
Gupta, Abhinav [1 ]
机构
[1] Carnegie Mellon Univ, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One characteristic that sets humans apart from modern learning-based computer vision algorithms is the ability to acquire knowledge about the world and use that knowledge to reason about the visual world. Humans can learn about the characteristics of objects and the relationships that occur between them to learn a large variety of visual concepts, often with few examples. This paper investigates the use of structured prior knowledge in the form of knowledge graphs and shows that using this knowledge improves performance on image classification. We build on recent work on end-to-end learning on graphs, introducing the Graph Search Neural Network as a way of efficiently incorporating large knowledge graphs into a vision classification pipeline. We show in a number of experiments that our method outperforms standard neural network baselines for multi-label classification.
引用
收藏
页码:20 / 28
页数:9
相关论文
共 39 条
  • [1] [Anonymous], The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results
  • [2] [Anonymous], 2014, EUR C COMP VIS
  • [3] [Anonymous], 2016, ICLR
  • [4] Borgwardt K. M., 2005, 5 IEEE INT C DATA MI, DOI DOI 10.1109/ICDM.2005.132
  • [5] Bruna J., 2013, ARXIV
  • [6] Carlson A., 2010, AAAI, V5, P3
  • [7] Chen X., 2013, CVPR
  • [8] Duan K., 2012, CVPR
  • [9] Duvenaud D.K., 2015, Advances in neural information processing Systems (NIPS), P2224, DOI DOI 10.48550/ARXIV.1509.09292
  • [10] Farhadi A., 2009, CVPR