Anthropic Research

Automated Alignment Researchers: Using large language models to scale scalable oversight

Anthropic Research · · 2k words
Loading…