Alignment for advanced machine learning systems J Taylor, E Yudkowsky, P LaVictoire, A Critch Ethics of artificial intelligence, 342-382, 2016 | 133 | 2016 |
Quantilizers: A safer alternative to maximizers for limited optimization J Taylor Workshops at the Thirtieth AAAI Conference on Artificial Intelligence, 2016 | 56 | 2016 |
Logical induction S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor arXiv preprint arXiv:1609.03543, 2016 | 52 | 2016 |
A formal solution to the grain of truth problem J Leike, J Taylor, B Fallenstein arXiv preprint arXiv:1609.05058, 2016 | 15 | 2016 |
Asymptotic convergence in online learning with unbounded delays S Garrabrant, N Soares, J Taylor arXiv preprint arXiv:1604.05280, 2016 | 12 | 2016 |
A formal approach to the problem of logical non-omniscience S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor arXiv preprint arXiv:1707.08747, 2017 | 11 | 2017 |
Reflective oracles: A foundation for game theory in artificial intelligence B Fallenstein, J Taylor, PF Christiano International Workshop on Logic, Rationality and Interaction, 411-415, 2015 | 11 | 2015 |
Reflective variants of Solomonoff induction and AIXI B Fallenstein, N Soares, J Taylor International Conference on Artificial General Intelligence, 60-69, 2015 | 5 | 2015 |
Alignment for Advanced Machine Learning Systems, Ethics of artificial intelligence J Taylor, E Yudkowsky, P LaVictoire, A Critch Oxford University Press, 0 | 5 | |
Reflective oracles: A foundation for classical game theory B Fallenstein, J Taylor, PF Christiano arXiv preprint arXiv:1508.04145, 2015 | 4 | 2015 |
Compressionism: a theory of mind based on data compression P Maguire, O Mulhall, R Maguire, J Taylor Proceedings of the 11th International Conference on Cognitive Science, 294-299, 2015 | 4 | 2015 |
Limit-Computable Grains of Truth for Arbitrary Computable Extensive-Form (Un) Known Games C Wyeth, M Hutter, J Leike, J Taylor | | 2024 |
Identifying and interpreting tuning dimensions in deep networks NS Dey, JE Taylor, BP Tripp, A Wong, GW Taylor arXiv preprint arXiv:2011.03043, 2020 | | 2020 |
Logical Induction (Abridged) S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor arXiv preprint arXiv:1609.03543, 2016 | | 2016 |
Logical Induction Abridged version, early draft S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor | | 2016 |