Active Object Localization with Deep Reinforcement Learning

被引:309
作者
Caicedo, Juan C. [1 ]
Lazebnik, Svetlana [2 ]
机构
[1] Fdn Univ Konrad Lorenz, Bogota, Colombia
[2] Univ Illinois, Urbana, IL USA
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
D O I
10.1109/ICCV.2015.286
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We present an active detection model for localizing objects in scenes. The model is class-specific and allows an agent to focus attention on candidate regions for identifying the correct location of a target object. This agent learns to deform a bounding box using simple transformation actions, with the goal of determining the most specific location of target objects following top-down reasoning. The proposed localization agent is trained using deep reinforcement learning, and evaluated on the Pascal VOC 2007 dataset. We show that agents guided by the proposed model are able to localize a single instance of an object after analyzing only between 11 and 25 regions in an image, and obtain the best detection results among systems that do not use object proposals for object localization.
引用
收藏
页码:2488 / 2496
页数:9
相关论文
共 38 条
[1]
Abbeel P., 2004, P 21 INT C MACH LEAR
[2]
[Anonymous], 2014, arXiv
[3]
[Anonymous], 2015, J. Mach. Learn. Res.
[4]
Correlational spectral clustering [J].
Blaschko, Matthew B. ;
Lampert, Christoph H. .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :93-+
[5]
State-of-the-Art in Visual Attention Modeling [J].
Borji, Ali ;
Itti, Laurent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :185-207
[6]
Chavali N., 2015, ARXIV1505055836
[7]
BING: Binarized Normed Gradients for Objectness Estimation at 300fps [J].
Cheng, Ming-Ming ;
Zhang, Ziming ;
Lin, Wen-Yan ;
Torr, Philip .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3286-3293
[8]
Coates A., 2008, ICML
[9]
Divvala SK, 2009, PROC CVPR IEEE, P1271, DOI 10.1109/CVPRW.2009.5206532
[10]
Learning Collections of Part Models for Object Recognition [J].
Endres, Ian ;
Shih, Kevin J. ;
Jiaa, Johnston ;
Hoiem, Derek .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :939-946