Skip to content

Nicholas Schiefer

Nicholas Schiefer is a researcher at Anthropic working on understanding and improving the behavior of large language models, with focus on bias, safety, and alignment.

Sources in this wiki

  • [[2023-ganguli-moral-self-correction]]

Topics