Swin transformer: Hierarchical vision transformer using shifted windows Z Liu, Y Lin, Y Cao, H Hu, Y Wei, Z Zhang, S Lin, B Guo Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 7418 | 2021 |
Swin transformer v2: Scaling up capacity and resolution Z Liu, H Hu, Y Lin, Z Yao, Z Xie, Y Wei, J Ning, Y Cao, Z Zhang, L Dong, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 426 | 2022 |
Simmim: A simple framework for masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 396 | 2022 |
Propagate yourself: Exploring pixel-level consistency for unsupervised visual representation learning Z Xie, Y Lin, Z Zhang, Y Cao, S Lin, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 270 | 2021 |
Negative margin matters: Understanding margin in few-shot classification B Liu, Y Cao, Y Lin, Q Li, Z Zhang, M Long, H Hu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 227 | 2020 |
Self-supervised learning with swin transformers Z Xie, Y Lin, Z Yao, Z Zhang, Q Dai, Y Cao, H Hu arXiv preprint arXiv:2105.04553, 2021 | 105 | 2021 |
A simple baseline for zero-shot semantic segmentation with pre-trained vision-language model M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu, X Bai arXiv preprint arXiv:2112.14757, 2021 | 53 | 2021 |
Parametric instance classification for unsupervised visual feature learning Y Cao, Z Xie, B Liu, Y Lin, Z Zhang, H Hu Advances in neural information processing systems 33, 15614-15624, 2020 | 49 | 2020 |
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu, X Bai Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 20 | 2022 |
Leveraging batch normalization for vision transformers Z Yao, Y Cao, Y Lin, Z Liu, Z Zhang, H Hu Proceedings of the IEEE/CVF International Conference on Computer Vision, 413-422, 2021 | 14 | 2021 |
On data scaling in masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, Y Wei, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 12 | 2023 |
Bootstrap your object detector via mixed training M Xu, Z Zhang, F Wei, Y Lin, Y Cao, S Lin, H Hu, X Bai Advances in Neural Information Processing Systems 34, 11315-11325, 2021 | 5 | 2021 |
Could Giant Pretrained Image Models Extract Universal Representations? Y Lin, Z Liu, Z Zhang, H Hu, N Zheng, S Lin, Y Cao arXiv preprint arXiv:2211.02043, 2022 | 1 | 2022 |
A Simple Approach and Benchmark for 21,000-Category Object Detection Y Lin, C Li, Y Cao, Z Zhang, J Wang, L Wang, Z Liu, H Hu Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | | 2022 |
Supplementary Materials for SimMIM: A Simple Framework for Masked Image Modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu | | |