Why A.I. Safety Controls Are Not Very Effective

Scritto il 14/05/2026

da Cade Metz and Tiffany Hsu

Three years after the debut of ChatGPT, fooling A.I. systems into bad behavior is almost trivial.A.I. systems from companies like OpenAI and Anthropic have guardrails that often prove to be porous.