共 21 条
- [14] Mask R-CNN. He K M,Gkioxari G,Doll′ar P,Girshick R. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV) . 2017
- [15] MobileViT:Light-weight,General-purpose,and Mobile-friendly Vision Transformer. Mehta S,Rastegari M. . 2021
- [16] Exploring the limits of weakly supervised pretraining. Mahajan D,Girshick R,Ramanathan V,He K M,Paluri M,Li Y X,et al. Proceedings of the 15th European Conference on Computer Vision (ECCV) . 2018
- [17] Convolutional xformers for vision. Jeevan P,Sethi A. . 2022
- [18] Masked-attention mask transformer for universal image segmentation. Cheng B W,Misra I,Schwing A G,Kirillov A,Girdhar R. . 2021
- [19] Demystifying local vision transformer:Sparse connectivity,weight sharing and dynamic weight. Han Q,Fan Z J,Dai Q,Sun L,Cheng M M,Liu J Y,et al. . 2021
- [20] Unified perceptual parsing for scene understanding. Xiao T T,Liu Y C,Zhou B L,Jiang Y N,Sun J. Proceedings of the15th European Conference on Computer Vision(ECCV) . 2018