Follow
Rohit Girdhar
Rohit Girdhar
Research Scientist, Fundamental AI Research (FAIR), Meta
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Learning a Predictable and Generative Vector Representation for Objects
R Girdhar, DF Fouhey, M Rodriguez, A Gupta
European Conference on Computer Vision (ECCV) 2016, 2016
7182016
Video Action Transformer Network
R Girdhar, J Carreira, C Doersch, A Zisserman
Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 2019
6362019
ActionVLAD: Learning spatio-temporal aggregation for action classification
R Girdhar, D Ramanan, A Gupta, J Sivic, B Russell
Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017
4962017
Attentional pooling for action recognition
R Girdhar, D Ramanan
Advances in Neural Information Processing Systems (NeurIPS), 2017, 2017
3752017
Masked-attention mask transformer for universal image segmentation
B Cheng, I Misra, AG Schwing, A Kirillov, R Girdhar
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
3552022
Detect-and-Track: Efficient Pose Estimation in Videos
R Girdhar, G Gkioxari, L Torresani, M Paluri, D Tran
Conference on Computer Vision and Pattern Recognition (CVPR), 2018, 2018
2392018
An end-to-end transformer model for 3d object detection
I Misra, R Girdhar, A Joulin
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
2272021
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1962022
Detecting twenty-thousand classes using image-level supervision
X Zhou, R Girdhar, A Joulin, P Krähenbühl, I Misra
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
1342022
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
R Girdhar, D Ramanan
International Conference on Learning Representations (ICLR), 2020, 2020
1302020
Self-supervised pretraining of 3d features on any point-cloud
Z Zhang, R Girdhar, A Joulin, I Misra
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1262021
Anticipative Video Transformer
R Girdhar, K Grauman
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
1022021
Omnivore: A single model for many visual modalities
R Girdhar, M Singh, N Ravi, L van der Maaten, A Joulin, I Misra
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
672022
Binge Watching: Scaling Affordance Learning from Sitcoms
X Wang, R Girdhar, A Gupta
Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017
662017
A better baseline for ava
R Girdhar, J Carreira, C Doersch, A Zisserman
IEEE/CVF Conference on Computer Vision and Pattern Recognition, ActivityNet …, 2018
642018
DistInit: Learning Video Representations without a Single Labeled Video
R Girdhar, D Tran, L Torresani, D Ramanan
International Conference on Computer Vision (ICCV) 2019, 2019
592019
Mask2former for video instance segmentation
B Cheng, A Choudhuri, I Misra, A Kirillov, R Girdhar, AG Schwing
arXiv preprint arXiv:2112.10764, 2021
422021
3D Spatial Recognition without Spatially Labeled 3D
Z Ren, I Misra, AG Schwing, R Girdhar
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
302021
MetaPix: Few-Shot Video Retargeting
J Lee, D Ramanan, R Girdhar
International Conference on Learning Representations (ICLR), 2020, 2020
242020
Video understanding as machine translation
B Korbar, F Petroni, R Girdhar, L Torresani
arXiv preprint arXiv:2006.07203, 2020
222020
The system can't perform the operation now. Try again later.
Articles 1–20