Marek Petrik
Marek Petrik
Verified email at cs.unh.edu - Homepage
Title
Cited by
Cited by
Year
Finite-Sample Analysis of Proximal Gradient TD Algorithms.
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik
UAI, 504-513, 2015
1052015
Feature selection using regularization in approximate linear programs for Markov decision processes
M Petrik, G Taylor, R Parr, S Zilberstein
arXiv preprint arXiv:1005.1860, 2010
842010
Safe policy improvement by minimizing robust baseline regret
M Ghavamzadeh, M Petrik, Y Chow
Advances in Neural Information Processing Systems, 2298-2306, 2016
762016
An Analysis of Laplacian Methods for Value Function Approximation in MDPs.
M Petrik
IJCAI, 2574-2579, 2007
702007
Learning parallel portfolios of algorithms
M Petrik, S Zilberstein
Annals of Mathematics and Artificial Intelligence 48 (1-2), 85-106, 2006
532006
A bilinear programming approach for multiagent planning
M Petrik, S Zilberstein
Journal of Artificial Intelligence Research 35, 235-274, 2009
372009
Average-Reward Decentralized Markov Decision Processes.
M Petrik, S Zilberstein
IJCAI, 1997-2002, 2007
372007
Anytime coordination using separable bilinear programs
M Petrik, S Zilberstein
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE 22 (1), 750, 2007
322007
Biasing approximate dynamic programming with a lower discount factor
M Petrik, B Scherrer
Advances in neural information processing systems 21, 1265-1272, 2008
302008
Tight approximations of dynamic risk measures
DA Iancu, M Petrik, D Subramanian
Mathematics of Operations Research 40 (3), 655-682, 2015
282015
Constraint relaxation in approximate linear programs
M Petrik, S Zilberstein
Proceedings of the 26th Annual International Conference on Machine Learning …, 2009
282009
Robust Approximate Bilinear Programming for Value Function Approximation.
M Petrik, S Zilberstein
Journal of Machine Learning Research 12 (10), 2011
262011
A practical method for solving contextual bandit problems using decision trees
AN Elmachtoub, R McNellis, S Oh, M Petrik
arXiv preprint arXiv:1706.04687, 2017
232017
An approximate solution method for large risk-averse Markov decision processes
M Petrik, D Subramanian
arXiv preprint arXiv:1210.4901, 2012
212012
RAAM: The benefits of robustness in approximating aggregated MDPs in reinforcement learning
M Petrik, D Subramanian
Advances in Neural Information Processing Systems, 1979-1987, 2014
192014
Hybrid least-squares algorithms for approximate policy evaluation
J Johns, M Petrik, S Mahadevan
Machine learning 76 (2-3), 243-256, 2009
192009
Agile logistics simulation and optimization for managing disaster responses
F Barahona, M Ettl, M Petrik, PM Rimshnick
2013 Winter Simulations Conference (WSC), 3340-3351, 2013
172013
Learning Heuristic Functions through Approximate Linear Programming.
M Petrik, S Zilberstein
ICAPS, 248-255, 2008
132008
Beyond confidence regions: Tight bayesian ambiguity sets for robust mdps
M Petrik, RH Russel
Advances in Neural Information Processing Systems, 7049-7058, 2019
122019
Policy-conditioned uncertainty sets for robust Markov decision processes
A Tirinzoni, M Petrik, X Chen, B Ziebart
Advances in Neural Information Processing Systems, 8939-8949, 2018
122018
The system can't perform the operation now. Try again later.
Articles 1–20