Daniel Paleka

120

2022202320242 112 104

Florian TramèrAssistant Professor of Computer Science, ETH ZurichVerified email at inf.ethz.ch
Nicholas CarliniGoogle DeepMindVerified email at google.com
Javier RandoETH ZurichVerified email at ai.ethz.ch
Lennart HeimCentre for the Governance of AIVerified email at governance.ai
David LindnerETH ZürichVerified email at inf.ethz.ch
Lukas FluriMaster graduate, ETH ZürichVerified email at ethz.ch
Amartya SanyalMax Planck Institute for Intelligent Systems, TuebingenVerified email at tuebingen.mpg.de

Daniel Paleka

Verified email at inf.ethz.ch


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Poisoning Web-Scale Training Datasets is Practical N Carlini, M Jagielski, CA Choquette-Choo, D Paleka, W Pearce, ... arXiv preprint arXiv:2302.10149, 2023	89	2023
Red-Teaming the Stable Diffusion Safety Filter J Rando, D Paleka, D Lindner, L Heim, F Tramèr arXiv preprint arXiv:2210.04610, 2022	75	2022
ARB: Advanced Reasoning Benchmark for Large Language Models T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ... arXiv preprint arXiv:2307.13692, 2023	25	2023
Evaluating Superhuman Models with Consistency Checks L Fluri, D Paleka, F Tramèr arXiv preprint arXiv:2306.09983, 2023	15	2023
A law of adversarial risk, interpolation, and label noise D Paleka, A Sanyal arXiv preprint arXiv:2207.03933, 2022	7	2022
Foundational Challenges in Assuring Alignment and Safety of Large Language Models U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ... arXiv preprint arXiv:2404.09932, 2024	4	2024
Stealing Part of a Production Language Model N Carlini, D Paleka, KD Dvijotham, T Steinke, J Hayase, AF Cooper, ... arXiv preprint arXiv:2403.06634, 2024	2	2024
Injectivity of ReLU neural networks at initialization D Paleka ETH Zurich, 2021	1	2021

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year