A framework for few-shot language model evaluation L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ... https://zenodo.org/doi/10.5281/zenodo.5371628, 8, 2021 | 472* | 2021 |
XDO: A double oracle algorithm for extensive-form games S McAleer, JB Lanier, KA Wang, P Baldi, R Fox Advances in Neural Information Processing Systems 34, 23128-23139, 2021 | 54 | 2021 |
Anytime optimal psro for two-player zero-sum games S McAleer, K Wang, M Lanctot, J Lanier, P Baldi, R Fox arXiv preprint arXiv:2201.07700 3, 2022 | 21* | 2022 |
Self-play psro: Toward optimal populations in two-player zero-sum games S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm arXiv preprint arXiv:2207.06541, 2022 | 12 | 2022 |
Lessons from the Trenches on Reproducible Evaluation of Language Models S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ... arXiv preprint arXiv:2405.14782, 2024 | 8 | 2024 |
The second NeurIPS tournament of reconnaissance blind chess G Perrotta, RW Gardner, C Lowman, M Taufeeque, N Tongia, ... NeurIPS 2021 Competitions and Demonstrations Track, 53-65, 2022 | 6 | 2022 |
The machine reconnaissance blind chess tournament of NeurIPS 2022 RW Gardner, G Perrotta, A Shah, S Kalyanakrishnan, KA Wang, G Clark, ... NeurIPS 2022 Competition Track, 119-132, 2023 | 3 | 2023 |
The Update-Equivalence Framework for Decision-Time Planning S Sokota, G Farina, DJ Wu, H Hu, KA Wang, JZ Kolter, N Brown arXiv preprint arXiv:2304.13138, 2023 | 3 | 2023 |
Bayesian Opponent Modeling in Multiplayer Imperfect-Information Games S Ganzfried, KA Wang, M Chiswick arXiv preprint arXiv:2212.06027, 2022 | 1 | 2022 |
Time is of the Essence: Why Decision-Time Planning Costs Matter KA Wang, J Xia, S Chung, J Wang, FP Velez, HJ Wang, A Greenwald Finding the Frame: An RLC Workshop for Examining Conceptual Frameworks, 0 | | |
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games SM McAleer, JB Lanier, KA Wang, P Baldi, T Sandholm, R Fox The Twelfth International Conference on Learning Representations, 0 | | |