AP-Loss for Accurate One-Stage Object Detection

被引:54
作者
Chen, Kean [1 ]
Lin, Weiyao [1 ]
Li, Jianguo [2 ]
See, John [3 ]
Wang, Ji [4 ]
Zou, Junni [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Intel Labs, Beijing 100080, Peoples R China
[3] Multimedia Univ, Fac Comp & Informat, Cyberjaya 63100, Selangor, Malaysia
[4] Tencent YouTu Lab, Shanghai 200233, Peoples R China
基金
中国国家自然科学基金;
关键词
Detectors; Task analysis; Measurement; Optimization; Object detection; Training; Proposals; Computer vision; object detection; machine learning; ranking loss;
D O I
10.1109/TPAMI.2020.2991457
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-stage object detectors are trained by optimizing classification-loss and localization-loss simultaneously, with the former suffering much from extreme foreground-background class imbalance issue due to the large number of anchors. This paper alleviates this issue by proposing a novel framework to replace the classification task in one-stage detectors with a ranking task, and adopting the average-precision loss (AP-loss) for the ranking problem. Due to its non-differentiability and non-convexity, the AP-loss cannot be optimized directly. For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks. We provide in-depth analyses on the good convergence property and computational complexity of the proposed algorithm, both theoretically and empirically. Experimental results demonstrate notable improvement in addressing the imbalance issue in object detection over existing AP-based optimization algorithms. An improved state-of-the-art performance is achieved in one-stage detectors based on AP-loss over detectors using classification-losses on various standard benchmarks. The proposed framework is also highly versatile in accommodating different network architectures. Code is available at https://github.com/cccorn/AP-loss.
引用
收藏
页码:3782 / 3798
页数:17
相关论文
共 67 条
  • [21] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [22] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [23] End-to-End Training of Object Class Detectors for Mean Average Precision
    Henderson, Paul
    Ferrari, Vittorio
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 198 - 213
  • [24] Joseph RK, 2016, CRIT POL ECON S ASIA, P1
  • [25] Parallel Feature Pyramid Network for Object Detection
    Kim, Seung-Wook
    Kook, Hyong-Keun
    Sun, Jee-Young
    Kang, Mun-Cheon
    Ko, Sung-Jea
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
  • [26] LEARNING ALGORITHMS WITH OPTIMAL STABILITY IN NEURAL NETWORKS
    KRAUTH, W
    MEZARD, M
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1987, 20 (11): : L745 - L752
  • [27] CornerNet: Detecting Objects as Paired Keypoints
    Law, Hei
    Deng, Jia
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) : 642 - 656
  • [28] LeCun Y., 2013, INT C LEARN REPR ICL, V1312, P6229
  • [29] Li B., 2018, ARXIV180702842
  • [30] Li BY, 2019, AAAI CONF ARTIF INTE, P8577