GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

被引:1389
作者
Cao, Yue [1 ,3 ]
Xu, Jiarui [2 ,3 ]
Lin, Stephen [3 ]
Wei, Fangyun [3 ]
Hu, Han [3 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年
关键词
D O I
10.1109/ICCVW.2019.00246
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies, via aggregating query-specific global context to each query position. However, through a rigorous empirical analysis, we have found that the global contexts modeled by non-local network are almost the same for different query positions within an image. In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation. We further observe that this simplified design shares similar structure with Squeeze-Excitation Network (SENet). Hence we unify them into a three-step general framework for global context modeling. Within the general framework, we design a better instantiation, called the global context (GC) block, which is lightweight and can effectively model the global context. The lightweight property allows us to apply it for multiple layers in a backbone network to construct a global context network (GCNet), which generally outperforms both simplified NLNet and SENet on major benchmarks for various recognition tasks.
引用
收藏
页码:1971 / 1980
页数:10
相关论文
共 42 条
[1]
[Anonymous], 2017, ARXIV170404861
[2]
[Anonymous], 2016, BMVC
[3]
[Anonymous], 2018, P ECCV
[4]
[Anonymous], 2017, IEEE C COMP VIS PATT
[5]
Carreira J., 2017, NEW MODEL KINETICS D, P6299
[6]
Chen Kai, 2019, arXiv preprint arXiv:1906.07155
[7]
Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]
Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[9]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]
Feichtenhofer Christoph, 2018, SLOWFAST NETWORKS VI