Monocular 3D Object Detection for Autonomous Driving

被引：628

作者：

Chen, Xiaozhi ^{[1
]}

Kundu, Kaustav ^{[2
]}

Zhang, Ziyu ^{[2
]}

Ma, Huimin ^{[1
]}

Fidler, Sanja ^{[2
]}

Urtasun, Raquel ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada

来源：

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/CVPR.2016.236

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The goal of this paper is to perform 3D object detection from a single monocular image in the domain of autonomous driving. Our method first aims to generate a set of candidate class-specific object proposals, which are then run through a standard CNN pipeline to obtain high-quality object detections. The focus of this paper is on proposal generation. In particular, we propose an energy minimization approach that places object candidates in 3D using the fact that objects should be on the ground-plane. We then score each candidate box projected to the image plane via several intuitive potentials encoding semantic segmentation, contextual information, size and location priors and typical object shape. Our experimental evaluation demonstrates that our object proposal generation approach significantly outperforms all monocular approaches, and achieves the best detection performance on the challenging KITTI benchmark, among published monocular competitors.

引用

页码：2147 / 2156

页数：10

共 55 条

[1] [Anonymous], ABS13127715 CORR
[2] [Anonymous], The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results
[3] [Anonymous], PAMI
[4] [Anonymous], 2015, ARXIV150205082
[5] [Anonymous], 2013, ICRA
[6] Multiscale Combinatorial Grouping
Arbelaez, Pablo
Pont-Tuset, Jordi
Barron, Jonathan T.
Marques, Ferran
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 328 - 335
[7] Badrinarayanan V., 2015, SEGNET DEEP CONVOLUT, DOI DOI 10.1109/TPAMI.2016.2644615
[8] Benenson R., 2013, CVPR 2013
[9] Cadena C., 2015, ICRA
[10] Learning Complexity-Aware Cascades for Deep Pedestrian Detection
Cai, Zhaowei
Saberian, Mohammad
Vasconcelos, Nuno
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3361 - 3369

← 1 2 3 4 5 6 →