Follow
Kevin A. Wang
Kevin A. Wang
Brown University
Verified email at kevinwang.us
Title
Cited by
Cited by
Year
A framework for few-shot language model evaluation
L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ...
https://zenodo.org/doi/10.5281/zenodo.5371628, 8, 2021
472*2021
XDO: A double oracle algorithm for extensive-form games
S McAleer, JB Lanier, KA Wang, P Baldi, R Fox
Advances in Neural Information Processing Systems 34, 23128-23139, 2021
542021
Anytime optimal psro for two-player zero-sum games
S McAleer, K Wang, M Lanctot, J Lanier, P Baldi, R Fox
arXiv preprint arXiv:2201.07700 3, 2022
21*2022
Self-play psro: Toward optimal populations in two-player zero-sum games
S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm
arXiv preprint arXiv:2207.06541, 2022
122022
Lessons from the Trenches on Reproducible Evaluation of Language Models
S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ...
arXiv preprint arXiv:2405.14782, 2024
82024
The second NeurIPS tournament of reconnaissance blind chess
G Perrotta, RW Gardner, C Lowman, M Taufeeque, N Tongia, ...
NeurIPS 2021 Competitions and Demonstrations Track, 53-65, 2022
62022
The machine reconnaissance blind chess tournament of NeurIPS 2022
RW Gardner, G Perrotta, A Shah, S Kalyanakrishnan, KA Wang, G Clark, ...
NeurIPS 2022 Competition Track, 119-132, 2023
32023
The Update-Equivalence Framework for Decision-Time Planning
S Sokota, G Farina, DJ Wu, H Hu, KA Wang, JZ Kolter, N Brown
arXiv preprint arXiv:2304.13138, 2023
32023
Bayesian Opponent Modeling in Multiplayer Imperfect-Information Games
S Ganzfried, KA Wang, M Chiswick
arXiv preprint arXiv:2212.06027, 2022
12022
Time is of the Essence: Why Decision-Time Planning Costs Matter
KA Wang, J Xia, S Chung, J Wang, FP Velez, HJ Wang, A Greenwald
Finding the Frame: An RLC Workshop for Examining Conceptual Frameworks, 0
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games
SM McAleer, JB Lanier, KA Wang, P Baldi, T Sandholm, R Fox
The Twelfth International Conference on Learning Representations, 0
The system can't perform the operation now. Try again later.
Articles 1–11