Monocular 3D Object Detection for Autonomous Driving

被引:628
作者
Chen, Xiaozhi [1 ]
Kundu, Kaustav [2 ]
Zhang, Ziyu [2 ]
Ma, Huimin [1 ]
Fidler, Sanja [2 ]
Urtasun, Raquel [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR.2016.236
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this paper is to perform 3D object detection from a single monocular image in the domain of autonomous driving. Our method first aims to generate a set of candidate class-specific object proposals, which are then run through a standard CNN pipeline to obtain high-quality object detections. The focus of this paper is on proposal generation. In particular, we propose an energy minimization approach that places object candidates in 3D using the fact that objects should be on the ground-plane. We then score each candidate box projected to the image plane via several intuitive potentials encoding semantic segmentation, contextual information, size and location priors and typical object shape. Our experimental evaluation demonstrates that our object proposal generation approach significantly outperforms all monocular approaches, and achieves the best detection performance on the challenging KITTI benchmark, among published monocular competitors.
引用
收藏
页码:2147 / 2156
页数:10
相关论文
共 55 条
  • [1] [Anonymous], ABS13127715 CORR
  • [2] [Anonymous], The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results
  • [3] [Anonymous], PAMI
  • [4] [Anonymous], 2015, ARXIV150205082
  • [5] [Anonymous], 2013, ICRA
  • [6] Multiscale Combinatorial Grouping
    Arbelaez, Pablo
    Pont-Tuset, Jordi
    Barron, Jonathan T.
    Marques, Ferran
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 328 - 335
  • [7] Badrinarayanan V., 2015, SEGNET DEEP CONVOLUT, DOI DOI 10.1109/TPAMI.2016.2644615
  • [8] Benenson R., 2013, CVPR 2013
  • [9] Cadena C., 2015, ICRA
  • [10] Learning Complexity-Aware Cascades for Deep Pedestrian Detection
    Cai, Zhaowei
    Saberian, Mohammad
    Vasconcelos, Nuno
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3361 - 3369