GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

被引：1389

作者：

Cao, Yue ^{[1
,3
]}

Xu, Jiarui ^{[2
,3
]}

Lin, Stephen ^{[3
]}

Wei, Fangyun ^{[3
]}

Hu, Han ^{[3
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China

[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

[3] Microsoft Res Asia, Beijing, Peoples R China

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年

关键词：

D O I：

10.1109/ICCVW.2019.00246

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies, via aggregating query-specific global context to each query position. However, through a rigorous empirical analysis, we have found that the global contexts modeled by non-local network are almost the same for different query positions within an image. In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation. We further observe that this simplified design shares similar structure with Squeeze-Excitation Network (SENet). Hence we unify them into a three-step general framework for global context modeling. Within the general framework, we design a better instantiation, called the global context (GC) block, which is lightweight and can effectively model the global context. The lightweight property allows us to apply it for multiple layers in a backbone network to construct a global context network (GCNet), which generally outperforms both simplified NLNet and SENet on major benchmarks for various recognition tasks.

引用

页码：1971 / 1980

页数：10

共 42 条

[1]

[Anonymous], 2017, ARXIV170404861

[2]

[Anonymous], 2016, BMVC

[3]

[Anonymous], 2018, P ECCV

[4]

[Anonymous], 2017, IEEE C COMP VIS PATT

[5]

Carreira J., 2017, NEW MODEL KINETICS D, P6299

[6]

Chen Kai, 2019, arXiv preprint arXiv:1906.07155

[7]

Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[8]

Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[9]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[10]

Feichtenhofer Christoph, 2018, SLOWFAST NETWORKS VI

← 1 2 3 4 5 →