Carroll L. Wainwright¶
Researcher at OpenAI. Co-author of InstructGPT, contributing to work on reinforcement learning from human feedback for language model alignment.
Sources in this wiki¶
- [[2022-ouyang-instructgpt|Training language models to follow instructions with human feedback]]