LabelMe: A database and web-based tool for image annotation

被引:2472
作者
Russell, Bryan C. [1 ]
Torralba, Antonio [1 ]
Murphy, Kevin P. [2 ]
Freeman, William T. [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Univ British Columbia, Dept Comp Sci & Stat, Vancouver, BC V6T 1Z4, Canada
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
database; annotation tool; object recognition; object detection;
D O I
10.1007/s11263-007-0090-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.
引用
收藏
页码:157 / 173
页数:17
相关论文
共 44 条
[21]  
FEIFEI L, 2007, IN PRESS IEEE T PATT
[22]  
Fellbaum C, 1998, WORDNET ELECT LEXICA
[23]  
Griffin G., 2006, CALTECH256
[24]  
HEISELE B, 2001, CVPR
[25]  
Hoiem D., 2006, CVPR
[26]  
Ide N, 1998, COMPUT LINGUIST, V24, P1
[27]  
LECUN Y, 2004, P CVPR 04 LOS AL
[28]  
LEIBE B, 2005, THESIS
[29]  
LI Y, 2002, P INT C PATT REC
[30]   Modeling the shape of the scene: A holistic representation of the spatial envelope [J].
Oliva, A ;
Torralba, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) :145-175