ImageNet Large Scale Visual Recognition Challenge

被引:110
作者
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael Bernstein
Alexander C. Berg
Li Fei-Fei
机构
[1] Stanford University,
[2] University of Michigan,undefined
[3] Massachusetts Institute of Technology,undefined
[4] UNC Chapel Hill,undefined
来源
International Journal of Computer Vision | 2015年 / 115卷
关键词
Dataset; Large-scale; Benchmark; Object recognition; Object detection;
D O I
暂无
中图分类号
学科分类号
摘要
The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images. The challenge has been run annually from 2010 to present, attracting participation from more than fifty institutions. This paper describes the creation of this benchmark dataset and the advances in object recognition that have been possible as a result. We discuss the challenges of collecting large-scale ground truth annotation, highlight key breakthroughs in categorical object recognition, provide a detailed analysis of the current state of the field of large-scale image classification and object detection, and compare the state-of-the-art computer vision accuracy with human accuracy. We conclude with lessons learned in the 5 years of the challenge, and propose future directions and improvements.
引用
收藏
页码:211 / 252
页数:41
相关论文
共 58 条
  • [1] Ahonen T(2006)Face description with local binary patterns: Application to face recognition Pattern Analysis and Machine Intelligence 28 2037-2041
  • [2] Hadid A(2012)Measuring the objectness of image windows IEEE Transactions on Pattern Analysis and Machine Intelligence 34 2189-2202
  • [3] Pietikinen M(2011)Contour detection and hierarchical image segmentation IEEE Transaction on Pattern Analysis and Machine Intelligence 33 898-916
  • [4] Alexe B(2006)Online passive-aggressive algorithms Journal of Machine Learning Research 7 551-585
  • [5] Deselares T(2010)The Pascal Visual Object Classes (VOC) challenge International Journal of Computer Vision 88 303-338
  • [6] Ferrari V(2014)The Pascal Visual Object Classes (VOC) challenge—A retrospective International Journal of Computer Vision 111 98-136
  • [7] Arbelaez P(2010)Object detection with discriminatively trained part based models IEEE Transactions on Pattern Analysis and Machine Intelligence 32 1627-1645
  • [8] Maire M(2013)Vision meets robotics: The kitti dataset International Journal of Robotics Research 32 1231-1237
  • [9] Fowlkes C(2011)Nonparametric scene parsing via label transfer IEEE Transactions on Pattern Analysis and Machine Intelligence 32 2368-2382
  • [10] Malik J(2004)Distinctive image features from scale-invariant keypoints International Journal of Computer Vision 60 91-110