Follow
Xudong Lin
Title
Cited by
Cited by
Year
Deep Adversarial Metric Learning
Y Duan, W Zheng, X Lin, J Lu, J Zhou
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
2592018
All in one: Exploring unified video-language pre-training
J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1952023
Dmc-net: Generating discriminative motion cues for fast compressed video action recognition
Z Shou, X Lin, Y Kalantidis, L Sevilla-Lara, M Rohrbach, SF Chang, Z Yan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1572019
Deep Variational Metric Learning
X Lin, Y Duan, Q Dong, J Lu, J Zhou
Proceedings of the European Conference on Computer Vision (ECCV), 689-704, 2018
1292018
Clip-event: Connecting text and images with event structures
M Li, R Xu, S Wang, L Zhou, X Lin, C Zhu, M Zeng, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1262022
Language models with image descriptors are strong few-shot video-language learners
Z Wang, M Li, R Xu, L Zhou, J Lei, X Lin, S Wang, Z Yang, C Zhu, ...
Advances in Neural Information Processing Systems 35, 8483-8497, 2022
1102022
Object-aware Video-language Pre-training for Retrieval
J Wang, Y Ge, G Cai, R Yan, X Lin, Y Shan, X Qie, MZ Shou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
792022
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
X Lin, G Bertasius, J Wang, SF Chang, D Parikh, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
742021
Learning To Recognize Procedural Activities with Distant Supervision
X Lin, F Petroni, G Bertasius, M Rohrbach, SF Chang, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
722022
Context-Gated Convolution
X Lin, L Ma, W Liu, SF Chang
ECCV 2020, 2019
612019
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
H Wen, Y Lin, T Lai, X Pan, S Li, X Lin, B Zhou, M Li, H Wang, H Zhang, ...
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
582021
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning
Y Duan, Z Wang, J Lu, X Lin, J Zhou
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
362018
BLINK: Multimodal Large Language Models Can See but Not Perceive
X Fu, Y Hu, B Li, Y Feng, H Wang, X Lin, D Roth, NA Smith, WC Ma, ...
arXiv preprint arXiv:2404.12390, 2024
342024
Resin-11: Schema-guided event prediction for 11 newsworthy scenarios
X Du, Z Zhang, S Li, P Yu, H Wang, T Lai, X Lin, Z Wang, I Liu, B Zhou, ...
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
342022
Joint Multimedia Event Extraction from Video and Article
B Chen, X Lin, C Thomas, M Li, S Yoshida, L Chum, H Ji, SF Chang
arXiv preprint arXiv:2109.12776, 2021
292021
Supervised masked knowledge distillation for few-shot transformers
H Lin, G Han, J Ma, S Huang, X Lin, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
282023
Towards fast adaptation of pretrained contrastive models for multi-channel video-language retrieval
X Lin, S Tiwari, S Huang, M Li, MZ Shou, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
242023
Learning to Decompose Visual Features with Latent Textual Prompts
F Wang, M Li, X Lin, H Lv, AG Schwing, H Ji
arXiv preprint arXiv:2210.04287, 2022
232022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
R Gangi Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, ...
arXiv e-prints, arXiv: 2112.10728, 2021
21*2021
Video-Text Pre-training with Learned Regions
R Yan, MZ Shou, Y Ge, AJ Wang, X Lin, G Cai, J Tang
arXiv preprint arXiv:2112.01194, 2021
182021
The system can't perform the operation now. Try again later.
Articles 1–20