Shalabh Bhatnagar

Cited by

	All	Since 2019
Citations	7256	3853
h-index	34	25
i10-index	90	51

1100

550

275

825

200320042005200620072008200920102011201220132014201520162017201820192020202120222023202427 30 59 71 64 62 87 133 231 232 255 281 280 294 309 423 520 524 732 753 1085 236

Public access

View all

33 articles

10 articles

available

not available

Based on funding mandates

Shalabh Bhatnagar

Professor in the Department of Computer Science and Automation, Indian Institute of Science

Verified email at iisc.ac.in - Homepage

Stochastic systems control simulation optimization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Natural Actor Critic Algorithms S Bhatnagar, R Sutton, M Ghavamzadeh, Mohammed and Lee Automatica 45 (11), 2471-2482, 2009	907	2009
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	696	2009
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods HLPLAP S.Bhatnagar Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation …, 2013	443*	2013
Reinforcement learning with function approximation for traffic signal control LA Prashanth, S Bhatnagar IEEE Transactions on Intelligent Transportation Systems 12 (2), 412-421, 2010	372	2010
Toward off-policy learning control with function approximation. HR Maei, C Szepesvári, S Bhatnagar, RS Sutton ICML 10, 719-726, 2010	331	2010
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	329	2009
An online actor–critic algorithm with function approximation for constrained markov decision processes S Bhatnagar, K Lakshmanan Journal of Optimization Theory and Applications 153, 688-708, 2012	317	2012
An actor–critic algorithm with function approximation for discounted cost constrained Markov decision processes S Bhatnagar Systems & Control Letters 59 (12), 760-766, 2010	262	2010
Incremental natural actor-critic algorithms S Bhatnagar, M Ghavamzadeh, M Lee, RS Sutton Advances in neural information processing systems 20, 2007	241	2007
Memory-based deep reinforcement learning for obstacle avoidance in UAV with limited environment knowledge A Singla, S Padakandla, S Bhatnagar IEEE transactions on intelligent transportation systems 22 (1), 107-118, 2019	202	2019
Reinforcement learning algorithm for non-stationary environments S Padakandla, P KJ, S Bhatnagar Applied Intelligence 50 (11), 3590-3606, 2020	128	2020
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences S Bhatnagar, MC Fu, SI Marcus, IJ Wang ACM Transactions on Modeling and Computer Simulation (TOMACS) 13 (2), 180-209, 2003	115	2003
Multi-agent reinforcement learning for traffic signal control KJ Prabuchandran, HK AN, S Bhatnagar 17th International IEEE Conference on Intelligent Transportation Systems …, 2014	111	2014
A time aggregation approach to Markov decision processes XR Cao, Z Ren, S Bhatnagar, M Fu, S Marcus Automatica 38 (6), 929-943, 2002	89	2002
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	84	2011
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 15 (1), 74-107, 2005	78	2005
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	68	2015
Two time-scale stochastic approximation with controlled Markov noise and off-policy temporal-difference learning P Karmakar, S Bhatnagar Mathematics of Operations Research 43 (1), 130-151, 2018	67	2018
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 18 (1), 1-35, 2007	67	2007
Two-timescale algorithms for simulation optimization of hidden Markov models S Bhatnagar, MC Fu, SI Marcus, S Bhatnagar Iie Transactions 33 (3), 245-258, 2001	59	2001

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by