Nicholas Schiefer¶ Nicholas Schiefer is a researcher at Anthropic working on understanding and improving the behavior of large language models, with focus on bias, safety, and alignment. Sources in this wiki¶ [[2023-ganguli-moral-self-correction]] Topics¶ Bias in Language Models, Model Alignment, AI Safety