Selected Papers

Scaling Laws For Scalable Oversight. Joshua Engels*, David Baek*, Subhash Kantamneni*, and Max Tegmark. Neurips 2025 (Spotlight). Paper | Code | Twitter

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing. Subhash Kantamneni*, Joshua Engels*, Senthooran Rajamanoharan, Max Tegmark, and Neel Nanda. ICML 2025. Paper | Code | Twitter

Low Rank Adapting Models for Sparse Autoencoders. Mathew Chen*, Joshua Engels*, and Max Tegmark. ICML 2025. Paper | Code | Twitter

Decomposing the Dark Matter of Sparse Autoencoders. Joshua Engels, Logan Smith, and Max Tegmark. TMLR 2025. Paper | Code | Twitter

Efficient Dictionary Learning with Switch Sparse Autoencoders. Anish Mudide, Joshua Engels, Eric J Michaud, Max Tegmark, and Christian Schroeder de Witt. ICLR 2025. Paper | Code | Twitter

Not All Language Model Features Are Linear. Joshua Engels, Eric J. Michaud, Isaac Liao, Wes Gurnee, and Max Tegmark. ICLR 2025. Paper | Code | Twitter | Talk

* indicates equal contribution

Other Projects and Writing

Negative Results on Group SAEs

Spreadsheet of 50 Weird LLM Phenomenon