Follow
Lewis Hammond
Title
Cited by
Cited by
Year
Foundational challenges in assuring alignment and safety of large language models
U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ...
arXiv preprint arXiv:2404.09932, 2024
99*2024
Multi-Agent Reinforcement Learning with Temporal Logic Specifications
L Hammond, A Abate, J Gutierrez, M Wooldridge
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
512021
Rational verification: game-theoretic verification of multi-agent systems
A Abate, J Gutierrez, L Hammond, P Harrenstein, M Kwiatkowska, M Najib, ...
Applied Intelligence 51 (9), 6569-6584, 2021
292021
Lexicographic Multi-Objective Reinforcement Learning
J Skalse, L Hammond, C Griffin, A Abate
Proceedings of the 31st International Joint Conference on Artificial …, 2022
272022
Welfare Diplomacy: Benchmarking Language Model Cooperation
G Mukobi, H Erlebach, N Lauffer, L Hammond, A Chan, J Clifton
arXiv preprint arXiv:2310.08901, 2023
182023
Reasoning about causality in games
L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge
Artificial Intelligence 320, 103919, 2023
182023
Visibility into AI Agents
A Chan, C Ezell, M Kaufmann, K Wei, L Hammond, H Bradley, E Bluemke, ...
The 2024 ACM Conference on Fairness, Accountability, and Transparency, 958-973, 2024
152024
Rational verification for probabilistic systems
J Gutierrez, L Hammond, AW Lin, M Najib, M Wooldridge
Proceedings of the 18th International Conference on Principles of Knowledge …, 2021
142021
Open problems in technical ai governance
A Reuel, B Bucknall, S Casper, T Fist, L Soder, O Aarne, L Hammond, ...
arXiv preprint arXiv:2407.14981, 2024
132024
Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice
L Hammond, J Fox, T Everitt, A Abate, M Wooldridge
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
132021
Learning tractable probabilistic models for moral responsibility and blame
L Hammond, V Belle
Data Mining and Knowledge Discovery 35 (2), 621–659, 2021
12*2021
Bounded robustness in reinforcement learning via lexicographic objectives
DJ Ornia, L Romao, L Hammond, M Mazo Jr, A Abate
6th Annual Learning for Dynamics & Control Conference, 954-967, 2024
5*2024
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
SR Motwani, M Baranchuk, M Strohmeier, V Bolina, P Torr, L Hammond, ...
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
5*2024
On Imperfect Recall in Multi-Agent Influence Diagrams
J Fox, M MacDermott, L Hammond, P Harrenstein, A Abate, M Wooldridge
Proceedings of the 19th Conference on Theoretical Aspects of Rationality and …, 2023
42023
IDs for AI Systems
A Chan, N Kolt, P Wills, U Anwar, CS de Witt, N Rajkumar, L Hammond, ...
arXiv preprint arXiv:2406.12137, 2024
32024
All’s Well That Ends Well: Avoiding Side Effects with Distance-Impact Penalties
C Griffin, J Skalse, L Hammond, A Abate
ML Safety Workshop, 36th Conference on Neural Information Processing Systems …, 2022
22022
Cooperation and Control in Delegation Games
O Sourbut, L Hammond, H Wood
arXiv preprint arXiv:2402.15821, 2024
12024
Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence
R Trivedi, A Khan, J Clifton, L Hammond, EA Duéñez-Guzmán, ...
The Thirty-eight Conference on Neural Information Processing Systems …, 2024
2024
Defining and Mitigating Collusion in Multi-Agent Systems
J Foxabbott, S Deverett, K Senft, S Dower, L Hammond
Multi-Agent Security Workshop@ NeurIPS'23, 2023
2023
Attributing Blame To Decisions Using Tractable Probabilistic Models
L Hammond
University of Edinburgh, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–20