Rohit Girdhar

Cited by

	All	Since 2019
Citations	7204	6848
h-index	25	25
i10-index	28	28

2800

1400

700

2100

2017201820192020202120222023202469 249 433 549 690 1176 2767 1221

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ishan MisraResearch Scientist, Facebook AI ResearchVerified email at fb.com
Armand JoulinGoogle DeepMindVerified email at google.com
Deva RamananProfessor, Robotics Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Abhinav GuptaProfessor, Robotics Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Mannat SinghFAIR, Meta AIVerified email at fb.com
Lorenzo TorresaniMeta, Fundamental AI Research (FAIR)Verified email at meta.com
Alexander SchwingUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Andrew ZissermanUniversity of OxfordVerified email at robots.ox.ac.uk
Kristen GraumanProfessor of Computer Science, University of Texas at AustinVerified email at cs.utexas.edu
Alexander KirillovResearch Scientist, Facebook AI Research (FAIR)Verified email at fb.com
Carl DoerschResearch Scientist, DeepMindVerified email at google.com
João CarreiraGoogle DeepMindVerified email at google.com
David FouheyNew York UniversityVerified email at nyu.edu
Du TranGoogleVerified email at google.com
Josef SivicCzech Technical University, CIIRC, ELLIS Unit PragueVerified email at cvut.cz
Bryan RussellResearcher, AdobeVerified email at adobe.com
Bowen ChengUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Philipp KrähenbühlUT AustinVerified email at cs.utexas.edu
Alaaeldin El-NoubyResearch Scientist, AppleVerified email at apple.com
Mikel RodriguezDeepMindVerified email at deepmind.com

Rohit Girdhar

Research Scientist, Fundamental AI Research (FAIR), Meta

Verified email at fb.com - Homepage

Computer Vision Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Masked-attention mask transformer for universal image segmentation B Cheng, I Misra, AG Schwing, A Kirillov, R Girdhar Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	1230	2022
Learning a Predictable and Generative Vector Representation for Objects R Girdhar, DF Fouhey, M Rodriguez, A Gupta European Conference on Computer Vision (ECCV) 2016, 2016	812	2016
Video Action Transformer Network R Girdhar, J Carreira, C Doersch, A Zisserman Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 2019	797	2019
ActionVLAD: Learning spatio-temporal aggregation for action classification R Girdhar, D Ramanan, A Gupta, J Sivic, B Russell Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017	557	2017
Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	545	2022
Attentional pooling for action recognition R Girdhar, D Ramanan Advances in Neural Information Processing Systems (NeurIPS), 2017, 2017	406	2017
Detecting twenty-thousand classes using image-level supervision X Zhou, R Girdhar, A Joulin, P Krähenbühl, I Misra European Conference on Computer Vision, 350-368, 2022	396	2022
An end-to-end transformer model for 3d object detection I Misra, R Girdhar, A Joulin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	390	2021
Imagebind: One embedding space to bind them all R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	357	2023
Detect-and-Track: Efficient Pose Estimation in Videos R Girdhar, G Gkioxari, L Torresani, M Paluri, D Tran Conference on Computer Vision and Pattern Recognition (CVPR), 2018, 2018	287	2018
Self-supervised pretraining of 3d features on any point-cloud Z Zhang, R Girdhar, A Joulin, I Misra Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	224	2021
Anticipative Video Transformer R Girdhar, K Grauman IEEE/CVF International Conference on Computer Vision (ICCV), 2021	177	2021
Omnivore: A single model for many visual modalities R Girdhar, M Singh, N Ravi, L Van Der Maaten, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	161	2022
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning R Girdhar, D Ramanan International Conference on Learning Representations (ICLR), 2020, 2020	160	2020
Cut and learn for unsupervised object detection and instance segmentation X Wang, R Girdhar, SX Yu, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	93	2023
Binge Watching: Scaling Affordance Learning from Sitcoms X Wang, R Girdhar, A Gupta Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017	84	2017
DistInit: Learning Video Representations without a Single Labeled Video R Girdhar, D Tran, L Torresani, D Ramanan International Conference on Computer Vision (ICCV) 2019, 2019	71	2019
A better baseline for ava R Girdhar, J Carreira, C Doersch, A Zisserman IEEE/CVF Conference on Computer Vision and Pattern Recognition, ActivityNet …, 2018	71	2018
Learning video representations from large language models Y Zhao, I Misra, P Krähenbühl, R Girdhar Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	69	2023
Omnimae: Single model masked pretraining on images and videos R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	66*	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors