MLP-STEREO: HETEROGENEOUS FEATURE FUSION IN MLP FOR STEREO MATCHING

被引:2
作者
Ye, Shuiqiang [1 ]
Zeng, Pengcheng [1 ]
Li, Pengfei [1 ]
Wang, Weiqi [1 ]
Xinan, Wang [1 ]
Zhao, Yong [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
关键词
Stereo matching; MLP; cost aggregation; inductive bias; real-time network;
D O I
10.1109/ICIP46576.2022.9897348
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
CNNs' strong inductive biases of locality and weight sharing provide powerful representation ability and data sample utilization efficiency. However, the weight sharing might smooth out the discrepancy between similar pixels, resulting in the wrong matching between left and right camera-image pair in thin structures region and repetitive texture region. In this paper, we propose a novel Heterogeneous Feature Fusion in MLP (HFF-MLP) for Stereo matching. It employs MLP structure and relaxes the weights sharing in the local spatial region. To this end, pixels in thin structures region and repetitive texture region are dealt with independently using the exclusive weights. Based on HFF-MLP module, we design a real-time network, i.e., MLP-Stereo. Experimental results show that our proposed HFF-MLP achieves competitive results on KITTI 2015 test dataset with the running time of 46 milliseconds. Furthermore, it performs much better than other real-time network in thin structures region and repetitive texture region.
引用
收藏
页码:101 / 105
页数:5
相关论文
共 19 条
[1]
Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation [J].
Bangunharcana, Antyanta ;
Cho, Jae Won ;
Lee, Seokju ;
Kweon, In So ;
Kim, Kyung-Soo ;
Kim, Soohyun .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :3542-3548
[2]
Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[3]
Learning Depth with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379
[4]
Cheng XL, 2020, Arxiv, DOI arXiv:2010.13501
[5]
d'Ascoli S, 2021, Arxiv, DOI [arXiv:2103.10697, DOI 10.48550/ARXIV.2103.10697, 10.1088/1742-5468/ac9830]
[6]
Group-wise Correlation Stereo Network [J].
Guo, Xiaoyang ;
Yang, Kai ;
Yang, Wukui ;
Wang, Xiaogang ;
Li, Hongsheng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3268-3277
[7]
StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth Prediction [J].
Khamis, Sameh ;
Fanello, Sean ;
Rhemann, Christoph ;
Kowdle, Adarsh ;
Valentin, Julien ;
Izadi, Shahram .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :596-613
[8]
Lian D., 2021, arXiv
[9]
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[10]
Cascade Residual Learning: A Two-stage Convolutional Neural Network for Stereo Matching [J].
Pang, Jiahao ;
Sun, Wenxiu ;
Ren, Jimmy S. J. ;
Yang, Chengxi ;
Yan, Qiong .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :878-886