AP-Loss for Accurate One-Stage Object Detection

被引：54

作者：

Chen, Kean ^{[1
]}

Lin, Weiyao ^{[1
]}

Li, Jianguo ^{[2
]}

See, John ^{[3
]}

Wang, Ji ^{[4
]}

Zou, Junni ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

[2] Intel Labs, Beijing 100080, Peoples R China

[3] Multimedia Univ, Fac Comp & Informat, Cyberjaya 63100, Selangor, Malaysia

[4] Tencent YouTu Lab, Shanghai 200233, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Detectors; Task analysis; Measurement; Optimization; Object detection; Training; Proposals; Computer vision; object detection; machine learning; ranking loss;

D O I：

10.1109/TPAMI.2020.2991457

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One-stage object detectors are trained by optimizing classification-loss and localization-loss simultaneously, with the former suffering much from extreme foreground-background class imbalance issue due to the large number of anchors. This paper alleviates this issue by proposing a novel framework to replace the classification task in one-stage detectors with a ranking task, and adopting the average-precision loss (AP-loss) for the ranking problem. Due to its non-differentiability and non-convexity, the AP-loss cannot be optimized directly. For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks. We provide in-depth analyses on the good convergence property and computational complexity of the proposed algorithm, both theoretically and empirically. Experimental results demonstrate notable improvement in addressing the imbalance issue in object detection over existing AP-based optimization algorithms. An improved state-of-the-art performance is achieved in one-stage detectors based on AP-loss over detectors using classification-losses on various standard benchmarks. The proposed framework is also highly versatile in accommodating different network architectures. Code is available at https://github.com/cccorn/AP-loss.

引用

页码：3782 / 3798

页数：17

共 67 条

[21] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[22] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[23] End-to-End Training of Object Class Detectors for Mean Average Precision
Henderson, Paul
Ferrari, Vittorio
[J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 198 - 213
[24] Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[25] Parallel Feature Pyramid Network for Object Detection
Kim, Seung-Wook
Kook, Hyong-Keun
Sun, Jee-Young
Kang, Mun-Cheon
Ko, Sung-Jea
[J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
[26] LEARNING ALGORITHMS WITH OPTIMAL STABILITY IN NEURAL NETWORKS
KRAUTH, W
MEZARD, M
[J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1987, 20 (11): : L745 - L752
[27] CornerNet: Detecting Objects as Paired Keypoints
Law, Hei
Deng, Jia
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) : 642 - 656
[28] LeCun Y., 2013, INT C LEARN REPR ICL, V1312, P6229
[29] Li B., 2018, ARXIV180702842
[30] Li BY, 2019, AAAI CONF ARTIF INTE, P8577

← 1 2 3 4 5 6 7 →