AP-Loss for Accurate One-Stage Object Detection

被引:54
作者
Chen, Kean [1 ]
Lin, Weiyao [1 ]
Li, Jianguo [2 ]
See, John [3 ]
Wang, Ji [4 ]
Zou, Junni [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Intel Labs, Beijing 100080, Peoples R China
[3] Multimedia Univ, Fac Comp & Informat, Cyberjaya 63100, Selangor, Malaysia
[4] Tencent YouTu Lab, Shanghai 200233, Peoples R China
基金
中国国家自然科学基金;
关键词
Detectors; Task analysis; Measurement; Optimization; Object detection; Training; Proposals; Computer vision; object detection; machine learning; ranking loss;
D O I
10.1109/TPAMI.2020.2991457
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-stage object detectors are trained by optimizing classification-loss and localization-loss simultaneously, with the former suffering much from extreme foreground-background class imbalance issue due to the large number of anchors. This paper alleviates this issue by proposing a novel framework to replace the classification task in one-stage detectors with a ranking task, and adopting the average-precision loss (AP-loss) for the ranking problem. Due to its non-differentiability and non-convexity, the AP-loss cannot be optimized directly. For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks. We provide in-depth analyses on the good convergence property and computational complexity of the proposed algorithm, both theoretically and empirically. Experimental results demonstrate notable improvement in addressing the imbalance issue in object detection over existing AP-based optimization algorithms. An improved state-of-the-art performance is achieved in one-stage detectors based on AP-loss over detectors using classification-losses on various standard benchmarks. The proposed framework is also highly versatile in accommodating different network architectures. Code is available at https://github.com/cccorn/AP-loss.
引用
收藏
页码:3782 / 3798
页数:17
相关论文
共 67 条
  • [1] THE ADATRON - AN ADAPTIVE PERCEPTRON ALGORITHM
    ANLAUF, JK
    BIEHL, M
    [J]. EUROPHYSICS LETTERS, 1989, 10 (07): : 687 - 692
  • [2] [Anonymous], 2016, International Conference on Machine Learning
  • [3] [Anonymous], 2017, PROC NEURIPS MACH LE
  • [4] Multiscale Combinatorial Grouping
    Arbelaez, Pablo
    Pont-Tuset, Jordi
    Barron, Jonathan T.
    Marques, Ferran
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 328 - 335
  • [5] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
    Bell, Sean
    Zitnick, C. Lawrence
    Bala, Kavita
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
  • [6] CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1312 - 1328
  • [7] Chen BB, 2018, ADV SOC SCI EDUC HUM, V181, P453
  • [8] Towards Accurate One-Stage Object Detection with AP-Loss
    Chen, Kean
    Li, Jianguo
    Lin, Weiyao
    See, John
    Wang, Ji
    Duan, Lingyu
    Chen, Zhibo
    He, Changwei
    Zou, Junni
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5114 - 5122
  • [9] Cortes C, 2004, ADV NEUR IN, V16, P313
  • [10] Combining Ranking with Traditional Methods for Ordinal Class Imbalance
    Cruz, Ricardo
    Fernandes, Kelwin
    Pinto Costa, Joaquim F.
    Perez Ortiz, Marfa
    Cardoso, Jaime S.
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT II, 2017, 10306 : 538 - 548