Follow
Conghui He
Conghui He
Shanghai AI Laboratory
Verified email at pjlab.org.cn - Homepage
Title
Cited by
Cited by
Year
Llama-adapter v2: Parameter-efficient visual instruction model
P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ...
arXiv preprint arXiv:2304.15010, 2023
3642023
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
arXiv preprint arXiv:2307.06281, 2023
3122023
Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data
W Li, C He, J Fang, J Zheng, H Fu, L Yu
Remote Sensing 11 (4), 403, 2019
2262019
Sharegpt4v: Improving large multi-modal models with better captions
L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin
arXiv preprint arXiv:2311.12793, 2023
1672023
9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios
H Fu, C He, B Chen, Z Yin, Z Zhang, W Zhang, T Zhang, W Xue, W Liu, ...
Proceedings of the International Conference for High Performance Computing …, 2017
1402017
Persformer: 3d lane detection via perspective transformer and the openlane benchmark
L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng, H Li, C He, J Shi, Y Qiao, ...
European Conference on Computer Vision, 550-567, 2022
1262022
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, XDB Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, S Zhang, ...
arXiv preprint arXiv:2309.15112, 2023
982023
Internvid: A large-scale video-text dataset for multimodal understanding and generation
Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ...
arXiv preprint arXiv:2307.06942, 2023
972023
Influence selection for active learning
Z Liu, H Ding, H Zhong, W Li, J Dai, C He
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
812021
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
792024
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
582024
Think twice before driving: Towards scalable decoders for end-to-end autonomous driving
X Jia, P Wu, L Chen, J Xie, C He, J Yan, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
532023
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
512024
Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution
N Clinton, L Yu, H Fu, C He, P Gong
Remote Sensing 6 (8), 7320-7338, 2014
492014
Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery
W Li, C He, J Fang, H Fu
Proceedings of the IEEE conference on computer vision and pattern …, 2018
482018
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer
H Fu, J Liao, W Xue, L Wang, D Chen, L Gu, J Xu, N Ding, X Wang, C He, ...
SC'16: Proceedings of the International Conference for High Performance …, 2016
412016
Sphinx-x: Scaling data and parameters for a family of multi-modal large language models
P Gao, R Zhang, C Liu, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, ...
arXiv preprint arXiv:2402.05935, 2024
392024
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation
Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
372024
V3det: Vast vocabulary visual detection dataset
J Wang, P Zhang, T Chu, Y Cao, Y Zhou, T Wu, B Wang, C He, D Lin
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
362023
Joint semantic-geometric learning for polygonal building segmentation
W Li, W Zhao, H Zhong, C He, D Lin
Proceedings of the AAAI Conference on Artificial Intelligence 35 (3), 1958-1965, 2021
362021
The system can't perform the operation now. Try again later.
Articles 1–20