Chris Olah¶ Researcher at Anthropic working on neural network interpretability and AI alignment. Papers in this wiki¶ A General Language Assistant as a Laboratory for Alignment (2021)