Focal and efficient IOU loss for accurate bounding box regression

被引:822
作者
Zhang, Yi-Fan [1 ,2 ,3 ]
Ren, Weiqiang [4 ]
Zhang, Zhang [1 ,2 ,3 ]
Jia, Zhen [1 ,2 ]
Wang, Liang [1 ,2 ,3 ]
Tan, Tieniu [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, CRIPAC, Beijing, Peoples R China
[2] Chinese Acad Sci CASIA, Inst Automat, NLPR, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Horizon Robot, Beijing, Peoples R China
关键词
Object detection; Loss function design; Hard sample mining;
D O I
10.1016/j.neucom.2022.07.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In object detection, bounding box regression (BBR) is a crucial step that determines the object localization performance. However, we find that most previous loss functions for BBR have two main drawbacks: (i) Both `n-norm and IOU-based loss functions are inefficient to depict the objective of BBR, which leads to slow convergence and inaccurate regression results. (ii) Most of the loss functions ignore the imbalance problem in BBR that the large number of anchor boxes which have small overlaps with the target boxes contribute most to the optimization of BBR. To mitigate the adverse effects caused thereby, we perform thorough studies to exploit the potential of BBR losses in this paper. Firstly, an Efficient Intersection over Union (EIOU) loss is proposed, which explicitly measures the discrepancies of three geometric factors in BBR, i.e., the overlap area, the central point and the side length. After that, we state the Effective Example Mining (EEM) problem and propose a regression version of focal loss to make the regression process focus on high-quality anchor boxes. Finally, the above two parts are combined to obtain a new loss function, namely Focal-EIOU loss. Extensive experiments on both synthetic and real datasets are performed. Notable superiorities on both the convergence speed and the localization accuracy can be achieved over other BBR losses. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 32 条
  • [1] [Anonymous], 2020, ECCV
  • [2] Beery S, 2020, PROC CVPR IEEE, P13072, DOI 10.1109/CVPR42600.2020.01309
  • [3] Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
  • [4] Hierarchical Shot Detector
    Cao, Jiale
    Pang, Yanwei
    Han, Jungong
    Li, Xuelong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9704 - 9713
  • [5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] AP-Loss for Accurate One-Stage Object Detection
    Chen, Kean
    Lin, Weiyao
    Li, Jianguo
    See, John
    Wang, Ji
    Zou, Junni
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3782 - 3798
  • [7] Chen Zhiming, 2020, PIOU LOSS ACCURATE O
  • [8] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [9] LocNet: Improving Localization Accuracy for Object Detection
    Gidaris, Spyros
    Komodakis, Nikos
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 789 - 798
  • [10] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448