Follow
Peng Gao
Peng Gao
Shanghai AI Lab
Verified email at pjlab.org.cn - Homepage
Title
Cited by
Cited by
Year
Clip-adapter: Better vision-language models with feature adapters
P Gao, S Geng, R Zhang, T Ma, R Fang, Y Zhang, H Li, Y Qiao
International Journal of Computer Vision, 2021
4102021
Dynamic fusion with intra-and inter-modality attention flow for visual question answering
P Gao, Z Jiang, H You, P Lu, SCH Hoi, X Wang, H Li
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
383*2019
Uniformer: Unified transformer for efficient spatiotemporal representation learning
K Li, Y Wang, P Gao, G Song, Y Liu, H Li, Y Qiao
arXiv preprint arXiv:2201.04676, 2022
338*2022
Tip-adapter: Training-free clip-adapter for better vision-language modeling
R Zhang, R Fang, W Zhang, P Gao, K Li, J Dai, Y Qiao, H Li
arXiv preprint arXiv:2111.03930, 2021
324*2021
Llama-adapter: Efficient fine-tuning of language models with zero-init attention
R Zhang, J Han, A Zhou, X Hu, S Yan, P Lu, H Li, P Gao, Y Qiao
arXiv preprint arXiv:2303.16199, 2023
2532023
Fast convergence of detr with spatially modulated co-attention
P Gao, M Zheng, X Wang, J Dai, H Li
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
2482021
Pointclip: Point cloud understanding by clip
R Zhang, Z Guo, W Zhang, K Li, X Miao, B Cui, Y Qiao, P Gao, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2142022
Llama-adapter v2: Parameter-efficient visual instruction model
P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ...
arXiv preprint arXiv:2304.15010, 2023
1952023
End-to-end object detection with adaptive clustering transformer
M Zheng, P Gao, R Zhang, K Li, X Wang, H Li, H Dong
arXiv preprint arXiv:2011.09315, 2020
1912020
Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training
R Zhang, Z Guo, P Gao, R Fang, B Zhao, D Wang, Y Qiao, H Li
Advances in neural information processing systems 35, 27061-27074, 2022
1222022
Convmae: Masked convolution meets masked autoencoders
P Gao, T Ma, H Li, Z Lin, J Dai, Y Qiao
arXiv preprint arXiv:2205.03892, 2022
116*2022
Frozen clip models are efficient video learners
Z Lin, S Geng, R Zhang, P Gao, G de Melo, X Wang, J Dai, Y Qiao, H Li
European Conference on Computer Vision, 388-404, 2022
1022022
Multi-modality latent interaction network for visual question answering
P Gao, H You, Z Zhang, X Wang, H Li
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
92*2019
Question-guided hybrid convolution for visual question answering
P Gao, H Li, S Li, P Lu, Y Li, SCH Hoi, X Wang
Proceedings of the European Conference on Computer Vision (ECCV), 469-485, 2018
842018
Monodetr: Depth-guided transformer for monocular 3d object detection
R Zhang, H Qiu, T Wang, Z Guo, Z Cui, Y Qiao, H Li, P Gao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
752023
Container: Context aggregation network
P Gao, J Lu, H Li, R Mottaghi, A Kembhavi
arXiv preprint arXiv:2106.01401, 2021
69*2021
Learning where to focus for efficient video object detection
Z Jiang, Y Liu, C Yang, J Liu, P Gao, Q Zhang, S Xiang, C Pan
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
632020
Pointclip v2: Adapting clip for powerful 3d open-world learning
X Zhu, R Zhang, B He, Z Zeng, S Zhang, P Gao
arXiv preprint arXiv:2211.11682, 2022
582022
Learning 3d representations from 2d pre-trained models via image-to-point masked autoencoders
R Zhang, L Wang, Y Qiao, P Gao, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
572023
Prompt, generate, then cache: Cascade of foundation models makes strong few-shot learners
R Zhang, X Hu, B Li, S Huang, H Deng, Y Qiao, P Gao, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
562023
The system can't perform the operation now. Try again later.
Articles 1–20