Hailey Schoelkopf
Hailey Schoelkopf
Researcher, EleutherAI
Verified email at
Cited by
Cited by
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
Pythia: A suite for analyzing large language models across training and scaling
S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ...
International Conference on Machine Learning, 2397-2430, 2023
Crosslingual generalization through multitask finetuning
N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ...
arXiv preprint arXiv:2211.01786, 2022
Starcoder: may the source be with you!
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 2023
SantaCoder: don't reach for the stars!
LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ...
arXiv preprint arXiv:2301.03988, 2023
Llemma: An open language model for mathematics
Z Azerbayev, H Schoelkopf, K Paster, MD Santos, S McAleer, AQ Jiang, ...
arXiv preprint arXiv:2310.10631, 2023
Emergent and predictable memorization in large language models
S Biderman, U PRASHANTH, L Sutawika, H Schoelkopf, Q Anthony, ...
Advances in Neural Information Processing Systems 36, 2024
Folio: Natural language reasoning with first-order logic
S Han, H Schoelkopf, Y Zhao, Z Qi, M Riddell, L Benson, L Sun, E Zubova, ...
arXiv preprint arXiv:2209.00840, 2022
Bloom+ 1: Adding language support to bloom for zero-shot prompting
ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ...
arXiv preprint arXiv:2212.09535, 2022
A framework for few-shot language model evaluation, 12 2023
L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ...
URL https://zenodo. org/records/10256836 7, 0
Proofnet: Autoformalizing and formally proving undergraduate-level mathematics
Z Azerbayev, B Piotrowski, H Schoelkopf, EW Ayers, D Radev, J Avigad
arXiv preprint arXiv:2302.12433, 2023
Social choice for AI alignment: Dealing with diverse human feedback
V Conitzer, R Freedman, J Heitzig, WH Holliday, BM Jacobs, N Lambert, ...
arXiv preprint arXiv:2404.10271, 2024
GAIA search: Hugging face and pyserini interoperability for nlp training data exploration
A Piktus, O Ogundepo, C Akiki, A Oladipo, X Zhang, H Schoelkopf, ...
arXiv preprint arXiv:2306.01481, 2023
Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Z Azerbayev, A Ni, H Schoelkopf, D Radev
arXiv preprint arXiv:2211.16740, 2022
Suppressing Pink Elephants with Direct Principle Feedback
L Castricato, N Lile, S Anand, H Schoelkopf, S Verma, S Biderman
arXiv preprint arXiv:2402.07896, 2024
Lessons from the Trenches on Reproducible Evaluation of Language Models
S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ...
arXiv preprint arXiv:2405.14782, 2024
Transformer Math 101
Q Anthony, S Biderman, H Schoelkopf, 2023
Attributing Mode Collapse in the Fine-Tuning of Large Language Models
L O’Mahony, L Grinsztajn, H Schoelkopf, S Biderman
The system can't perform the operation now. Try again later.
Articles 1–18