Anthropic Research
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Anthropic Research
·
April 12, 2022
·
238 words
Light
Light
Sepia
Dark
0/0
←
→
×
Download
Original
Loading…
‹
›