Nelson Elhage¶ Researcher at Anthropic focused on AI alignment and interpretability of language models. Papers in this wiki¶ A General Language Assistant as a Laboratory for Alignment (2021)