Anthropic Research

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Anthropic Research · April 12, 2022 · 238 words

0/0

Download Original

Loading…