A red-team experiment found an AI agent could autonomously exploit a vulnerability in McKinsey’s internal chatbot platform, exposing millions of conversations before the issue was patched.
Enterprise AI agents are often framed as a model problem. We’re told that the leap from building chatbots to agentic systems depends on better reasoning, larger context windows, and smarter benchmarks ...