Follow
Zheng Ge
Zheng Ge
StepFun
Verified email at fuji.waseda.jp - Homepage
Title
Cited by
Cited by
Year
Yolox: Exceeding yolo series in 2021
JS Ge, Z, S Liu, F Wang, Z Li
arXiv preprint arXiv:2107.08430, 2021
53822021
Bevdepth: Acquisition of reliable depth for multi-view 3d object detection
Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
5612022
Ota: Optimal transport assignment for object detection
Z Ge, S Liu, Z Li, O Yoshie, J Sun
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
5292021
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with dynamic temporal stereo
Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
203*2022
NMS by representative region: Towards crowded pedestrian detection by proposal pairing
X Huang, Z Ge, Z Jie, O Yoshie
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1952020
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Z Qi, R Dong, G Fan, Z Ge, X Zhang, K Ma, L Yi
International Conference on Machine Learning (ICML), 2023, 2023
1082023
Implicit identity leakage: The stumbling block to improving deepfake detection generalization
S Dong, J Wang, R Ji, J Liang, H Fan, Z Ge
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1082023
Dreamllm: Synergistic multimodal comprehension and creation
R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ...
ICLR 2024 (Spotlight), 2024
1012024
Dense teacher: Dense pseudo-labels for semi-supervised object detection
H Zhou, Z Ge, S Liu, W Mao, Z Li, H Yu, J Sun
Proceedings of the European conference on computer vision (ECCV), 2022
942022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma
International Conference on Learning Representations (ICLR), 2023, 2022
912022
Vary: Scaling up the vision vocabulary for large vision-language models
H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang
ECCV 2024, 2024
572024
Exploring recurrent long-term temporal fusion for multi-view 3d perception
C Han, J Yang, J Sun, Z Ge, R Dong, H Zhou, W Mao, Y Peng, X Zhang
RA-L & IROS (Oral), 2024
542024
Sts: Surround-view temporal stereo for multi-view 3d detection
Z Wang, C Min, Z Ge, Y Li, Z Li, H Yang, D Huang
arXiv preprint arXiv:2208.10145, 2022
542022
Lla: Loss-aware label assignment for dense pedestrian detection
Z Ge, J Wang, X Huang, S Liu, O Yoshie
Neurocomputing 462, 272-281, 2021
492021
Ps-rcnn: Detecting secondary human instances in a crowd via primary object suppression
Z Ge, Z Jie, X Huang, R Xu, O Yoshie
2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020
412020
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning
L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ...
IJCAI 2024 (Long Oral), 2023
402023
Matrixvt: Efficient multi-camera to bev transformation for 3d perception
H Zhou, Z Ge, Z Li, X Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
392023
Shapellm: Universal 3d object understanding for embodied interaction
Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge, L Yi, K Ma
ECCV 2024, 2024
292024
Small language model meets with reinforced vision vocabulary
H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang
arXiv preprint arXiv:2401.12503, 2024
252024
Align-detr: Improving detr with simple iou-aware bce loss
Z Cai, S Liu, G Wang, Z Ge, X Zhang, D Huang
arXiv preprint arXiv:2304.07527, 2023
242023
The system can't perform the operation now. Try again later.
Articles 1–20