Collin Burns¶
Researcher at UC Berkeley working on AI safety, model interpretability, and understanding the latent knowledge and behavior of large language models. His research focuses on discovering what models know without explicit supervision and how to align model behavior with human values.