Botao Hao

Cited by

	All	Since 2019
Citations	728	716
h-index	16	16
i10-index	23	21

220

110

165

20182019202020212022202320244 16 40 107 160 220 167

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Tor LattimoreDeepMindVerified email at google.com
Zheng WenGoogle DeepMindVerified email at google.com
Yasin Abbasi YadkoriGoogle DeepMindVerified email at google.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityVerified email at princeton.edu
Will Wei SunAssociate Professor, Daniels School of Business, Purdue UniversityVerified email at purdue.edu
Nevena LazicDeepMindVerified email at google.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Jingfei ZhangEmory UniveristyVerified email at emory.edu
Anru ZhangDuke UniversityVerified email at duke.edu
尚作峰 (Zuofeng Shang)New Jersey Institute of TechnologyVerified email at njit.edu
Yufeng LiuUniversity of North Carolina at Chapel HillVerified email at email.unc.edu

Botao Hao

OpenAI

Verified email at openai.com - Homepage

reinforcement learning multi-armed bandits RLHF


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Simultaneous clustering and estimation of heterogeneous graphical models B Hao, WW Sun, Y Liu, G Cheng Journal of Machine Learning Research 18 (217), 1-58, 2018	75	2018
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	66	2020
Sparse and low-rank tensor estimation via cubic sketchings B Hao, AR Zhang, G Cheng International conference on artificial intelligence and statistics, 1319-1330, 2020	61	2020
Bootstrapping upper confidence bound B Hao, Y Abbasi-Yadkori, Z Wen, G Cheng 33rd Conference on Neural Information Processing Systems, 2019	61	2019
High-dimensional sparse linear bandits B Hao, T Lattimore, M Wang 34th Conference on Neural Information Processing Systems, 2020	60	2020
Bootstrapping fitted q-evaluation for off-policy inference B Hao, X Ji, Y Duan, H Lu, C Szepesvari, M Wang International Conference on Machine Learning, 4074-4084, 2021	38	2021
Sparse feature selection makes batch reinforcement learning more sample efficient B Hao, Y Duan, T Lattimore, C Szepesvári, M Wang International Conference on Machine Learning, 4063-4073, 2021	36	2021
Online sparse reinforcement learning B Hao, T Lattimore, C Szepesvári, M Wang International Conference on Artificial Intelligence and Statistics, 316-324, 2021	30	2021
Sparse tensor additive regression B Hao, B Wang, P Wang, J Zhang, J Yang, WW Sun Journal of machine learning research 22 (64), 1-43, 2021	28	2021
Adaptive approximate policy iteration B Hao, N Lazic, Y Abbasi-Yadkori, P Joulani, C Szepesvari Proceedings of the 24th International Conference on Artificial Intelligence …, 2020	27*	2020
Efficient local planning with linear function approximation D Yin, B Hao, Y Abbasi-Yadkori, N Lazić, C Szepesvári International Conference on Algorithmic Learning Theory, 1165-1192, 2022	25	2022
Residual bootstrap exploration for bandit algorithms CH Wang, Y Yu, B Hao, G Cheng arXiv preprint arXiv:2002.08436, 2020	20	2020
Information directed sampling for sparse linear bandits B Hao, T Lattimore, W Deng Advances in Neural Information Processing Systems 34, 16738-16750, 2021	19	2021
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 35, 12554-12565, 2022	17	2022
Regret Bounds for Information-Directed Reinforcement Learning B Hao, T Lattimore Advances in Neural Information Processing Systems, 2022	17	2022
Contextual information-directed sampling B Hao, T Lattimore, C Qin International Conference on Machine Learning, 8446-8464, 2022	16	2022
Bootstrapping Statistical Inference for Off-Policy Evaluation B Hao, X Ji, Y Duan, H Lu, C Szepesvári, M Wang arXiv preprint arXiv:2102.03607, 2021	16	2021
Interacting Contour Stochastic Gradient Langevin Dynamics W Deng, S Liang, B Hao, G Lin, F Liang The Tenth International Conference on Learning Representations, 2022	13	2022
Bandit phase retrieval T Lattimore, B Hao Advances in Neural Information Processing Systems 34, 18801-18811, 2021	11	2021
Low-rank tensor bandits B Hao, J Zhou, Z Wen, WW Sun arXiv e-prints, arXiv: 2007.15788, 2020	11	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors