Anthropic's Autonomous Alignment Agents Outperform Human Researchers: What This Means for AI Safety Progress
Anthropic demonstrates AI agents that autonomously conduct alignment research, outperforming human researchers and signaling a fundamental shift in how safety bottlenecks are addressed.