Boundary-aware cascade networks for temporal action segmentation Z Wang, Z Gao, L Wang, Z Li, G Wu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 160 | 2020 |
Lip: Local importance-based pooling Z Gao, L Wang, G Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 152 | 2019 |
Adamixer: A fast-converging query-based object detector Z Gao, L Wang, B Han, S Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 131 | 2022 |
Mutual supervision for dense object detection Z Gao, L Wang, G Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 41 | 2021 |
Stmixer: A one-stage sparse action detector T Wu, M Cao, Z Gao, G Wu, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 29 | 2023 |
VideoLLM-online: Online Video Large Language Model for Streaming Video J Chen, Z Lv, S Wu, KQ Lin, C Song, D Gao, JW Liu, Z Gao, D Mao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 13 | 2024 |
Sparseformer: Sparse visual recognition via limited latent tokens Z Gao, Z Tong, L Wang, MZ Shou arXiv preprint arXiv:2304.03768, 2023 | 7 | 2023 |
One token to seg them all: Language instructed reasoning segmentation in videos Z Bai, T He, H Mei, P Wang, Z Gao, J Chen, L Liu, Z Zhang, MZ Shou arXiv preprint arXiv:2409.19603, 2024 | 3 | 2024 |
Learning video context as interleaved multimodal sequences KQ Lin, P Zhang, D Gao, X Xia, J Chen, Z Gao, J Xie, X Xiao, MZ Shou European Conference on Computer Vision, 375-396, 2025 | 2 | 2025 |
Factorized Visual Tokenization and Generation Z Bai, J Gao, Z Gao, P Wang, Z Zhang, T He, MZ Shou arXiv preprint arXiv:2411.16681, 2024 | | 2024 |
Bootstrapping SparseFormers from Vision Foundation Models Z Gao, Z Tong, KQ Lin, J Chen, MZ Shou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |