‪Kshitij Sachan‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	30	30
h-index	3	3
i10-index	1	1

0

18

9

2023202412 17

Co-authors

Buck ShlegerisCTO, Redwood ResearchVerified email at rdwrs.com
Adam ScherlisVerified email at scherlis.com
Joe BentonAnthropicVerified email at anthropic.com
Adam JermynFlatiron Research FellowVerified email at flatironinstitute.org

Kshitij Sachan

Kshitij Sachan

Member of Technical Staff, Redwood Research

Verified email at rdwrs.com - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Polysemanticity and capacity in neural networks A Scherlis, K Sachan, AS Jermyn, J Benton, B Shlegeris arXiv preprint arXiv:2210.01892, 2022	13	2022
Sleeper agents: Training deceptive llms that persist through safety training E Hubinger, C Denison, J Mu, M Lambert, M Tong, M MacDiarmid, ... arXiv preprint arXiv:2401.05566, 2024	9	2024
Ai control: Improving safety despite intentional subversion R Greenblatt, B Shlegeris, K Sachan, F Roger arXiv preprint arXiv:2312.06942, 2023	5	2023
Debating with More Persuasive LLMs Leads to More Truthful Answers A Khan, J Hughes, D Valentine, L Ruis, K Sachan, A Radhakrishnan, ... arXiv preprint arXiv:2402.06782, 2024	3	2024

The system can't perform the operation now. Try again later.

Articles 1–4