Anthropic Research
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Anthropic Research
·
January 14, 2024
·
242 words
Light
Light
Sepia
Dark
0/0
←
→
×
Download
Original
Loading…
‹
›