Kamilė Lukošiūtė¶
Researcher at Anthropic working on language model evaluation and behavior analysis.
Sources in this wiki¶
- Discovering Language Model Behaviors with Model-Written Evaluations — Co-author; works on evaluations for discovering language model behaviors