Follow
Andrea Zanette
Title
Cited by
Cited by
Year
Tighter problem-dependent regret bounds in reinforcement learning without domain knowledge using value function bounds
A Zanette, E Brunskill
International Conference on Machine Learning, 7304-7312, 2019
3062019
Learning near optimal policies with low inherent bellman error
A Zanette, A Lazaric, M Kochenderfer, E Brunskill
International Conference on Machine Learning, 10978-10989, 2020
2402020
Frequentist regret bounds for randomized least-squares value iteration
A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric
International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020
1502020
Provable benefits of actor-critic methods for offline reinforcement learning
A Zanette, MJ Wainwright, E Brunskill
Advances in neural information processing systems 34, 13626-13640, 2021
1272021
Exponential lower bounds for batch reinforcement learning: Batch rl can be exponentially harder than online rl
A Zanette
International Conference on Machine Learning, 12287-12297, 2021
882021
Provably efficient reward-agnostic navigation with linear value iteration
A Zanette, A Lazaric, MJ Kochenderfer, E Brunskill
Advances in Neural Information Processing Systems 33, 11756-11766, 2020
612020
Cautiously optimistic policy optimization and exploration with linear function approximation
A Zanette, CA Cheng, A Agarwal
Conference on Learning Theory, 4473-4525, 2021
592021
Almost horizon-free structure-aware best policy identification with a generative model
A Zanette, MJ Kochenderfer, E Brunskill
Advances in Neural Information Processing Systems 32, 2019
412019
Limiting extrapolation in linear approximate value iteration
A Zanette, A Lazaric, MJ Kochenderfer, E Brunskill
Advances in Neural Information Processing Systems 32, 2019
392019
Robust super-level set estimation using Gaussian processes
A Zanette, J Zhang, MJ Kochenderfer
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2019
382019
Design of experiments for stochastic contextual linear bandits
A Zanette, K Dong, JN Lee, E Brunskill
Advances in Neural Information Processing Systems 34, 22720-22731, 2021
242021
Problem dependent reinforcement learning bounds which can identify bandit structure in mdps
A Zanette, E Brunskill
International Conference on Machine Learning, 5747-5755, 2018
232018
When is realizability sufficient for off-policy reinforcement learning?
A Zanette
International Conference on Machine Learning, 40637-40668, 2023
162023
Archer: Training language model agents via hierarchical multi-turn rl
Y Zhou, A Zanette, J Pan, S Levine, A Kumar
arXiv preprint arXiv:2402.19446, 2024
122024
Bellman residual orthogonalization for offline reinforcement learning
A Zanette, MJ Wainwright
Advances in Neural Information Processing Systems 35, 3137-3151, 2022
102022
Information directed reinforcement learning
A Zanette, R Sarkar
Tech. Rep., Technical report, Technical report, 2017
72017
Stabilizing q-learning with linear architectures for provable efficient learning
A Zanette, M Wainwright
International Conference on Machine Learning, 25920-25954, 2022
62022
Policy finetuning in reinforcement learning via design of experiments using offline data
R Zhang, A Zanette
Advances in Neural Information Processing Systems 36, 2024
42024
Enriching the finite element method with meshfree particles in structural mechanics
A Zanette, M Ferronato, C Janna
International Journal for Numerical Methods in Engineering 110 (7), 675-700, 2017
22017
Accelerating Best-of-N via Speculative Rejection
R Zhang, M Haider, M Yin, J Qiu, M Wang, P Bartlett, A Zanette
2nd Workshop on Advancing Neural Network Training: Computational Efficiency …, 0
1
The system can't perform the operation now. Try again later.
Articles 1–20