Selected Papers
Scaling Laws For Scalable Oversight. Joshua Engels*, David Baek*, Subhash Kantamneni*, and Max Tegmark. Neurips 2025 (Spotlight). Paper | Code | Twitter
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing. Subhash Kantamneni*, Joshua Engels*, Senthooran Rajamanoharan, Max Tegmark, and Neel Nanda. ICML 2025. Paper | Code | Twitter
Low Rank Adapting Models for Sparse Autoencoders. Mathew Chen*, Joshua Engels*, and Max Tegmark. ICML 2025. Paper | Code | Twitter
Decomposing the Dark Matter of Sparse Autoencoders. Joshua Engels, Logan Smith, and Max Tegmark. TMLR 2025. Paper | Code | Twitter
Efficient Dictionary Learning with Switch Sparse Autoencoders. Anish Mudide, Joshua Engels, Eric J Michaud, Max Tegmark, and Christian Schroeder de Witt. ICLR 2025. Paper | Code | Twitter
Not All Language Model Features Are Linear. Joshua Engels, Eric J. Michaud, Isaac Liao, Wes Gurnee, and Max Tegmark. ICLR 2025. Paper | Code | Twitter | Talk
* indicates equal contribution