Follow
Mudit Verma
Mudit Verma
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Position: LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks
S Kambhampati, K Valmeekam, L Guan, M Verma, K Stechly, S Bhambri, ...
Forty-first International Conference on Machine Learning, 2024
65*2024
Symbols as a lingua franca for bridging human-ai chasm for explainable and advisable ai systems
S Kambhampati, S Sreedharan, M Verma, Y Zha, L Guan
Proceedings of the AAAI Conference on Artificial Intelligence 36 (11), 12262 …, 2022
502022
Widening the pipeline in human-guided reinforcement learning with explanation and context-aware data augmentation
L Guan, M Verma, SS Guo, R Zhang, S Kambhampati
Advances in Neural Information Processing Systems 34, 21885-21897, 2021
402021
Bridging the gap: Providing post-hoc symbolic explanations for sequential decision-making problems with inscrutable representations
S Sreedharan, U Soni, M Verma, S Srivastava, S Kambhampati
arXiv preprint arXiv:2002.01080, 2020
332020
Theory of Mind abilities of Large Language Models in Human-Robot Interaction: An Illusion?
M Verma, S Bhambri, S Kambhampati
Companion of the 2024 ACM/IEEE International Conference on Human-Robot …, 2024
252024
Explanation augmented feedback in human-in-the-loop reinforcement learning
L Guan*, M Verma*, S Guo, R Zhang, S Kambhampati
arXiv preprint arXiv:2006.14804, 2020
212020
Bridging the gap: Providing post-hoc symbolic explanations for sequential decision-making problems with black box simulators
S Sreedharan, U Soni, M Verma, S Srivastava, S Kambhampati
arXiv preprint arXiv:2002.01080, 2020
192020
Trust-aware planning: Modeling trust evolution in longitudinal human-robot interaction
Z Zahedi, M Verma, S Sreedharan, S Kambhampati
ICAPS 2021 Workshop on Explainable AI Planning, 2021
172021
Fine-grained language identification with multilingual CapsNet model
M Verma, AB Buduru
2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM), 94-102, 2020
132020
Trust-aware planning: modeling trust evolution in iterated human-robot interaction
Z Zahedi, M Verma, S Sreedharan, S Kambhampati
Proceedings of the 2023 ACM/IEEE international conference on human-robot …, 2023
102023
Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning
A Gundawar, M Verma, L Guan, K Valmeekam, S Bhambri, ...
arXiv preprint arXiv:2405.20625, 2024
92024
Symbol guided hindsight priors for reward learning from human preferences
M Verma, K Metcalf
arXiv preprint arXiv:2210.09151, 2022
92022
Modeling the interplay between human trust and monitoring
Z Zahedi, S Sreedharan, M Verma, S Kambhampati
2022 17th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2022
82022
A novel framework for neural architecture search in the hill climbing domain
M Verma, P Sinha, K Goyal, A Verma, S Susan
2019 IEEE Second International Conference on Artificial Intelligence and …, 2019
82019
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
M Verma, S Bhambri, S Kambhampati
arXiv preprint arXiv:2302.08738, 2023
72023
Making smart homes smarter: optimizing energy consumption with human in the loop
M Verma, S Bhambri, S Gupta, AB Buduru
arXiv preprint arXiv:1912.03298, 2019
72019
Synthesizing policies that account for human execution errors caused by state aliasing in markov decision processes
S Gopalakrishnan, M Verma, S Kambhampati
ICAPS 2021 Workshop on Explainable AI Planning URL https://openreview. net/pdf, 2021
62021
Hindsight PRIORs for Reward Learning from Human Preferences
M Verma, K Metcalf
arXiv preprint arXiv:2404.08828, 2024
52024
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion
U Soni, N Thakur, S Sreedharan, L Guan, M Verma, M Marquez, ...
arXiv preprint arXiv:2210.15096, 2022
52022
Computing Policies That Account For The Effects Of Human Agent Uncertainty During Execution In Markov Decision Processes
S Gopalakrishnan, M Verma, S Kambhampati
arXiv preprint arXiv:2109.07436, 2021
52021
The system can't perform the operation now. Try again later.
Articles 1–20