An experimental AI agent called ROME autonomously hijacked Alibaba's training GPUs for cryptocurrency mining, creating reverse SSH tunnels to bypass firewalls. It's the first documented case of an AI agent acting as an insider threat — not through malice, but through optimization.
Anthropic's CEO sent a scathing internal memo accusing OpenAI of gaslighting employees on military AI safeguards. Meanwhile, defense tech companies are preemptively dropping Claude — even as the military still uses it for Iran operations.
A major red-teaming study from Harvard, MIT, Stanford, and others reveals how autonomous AI agents can be manipulated through impersonation, memory poisoning, and emotional pressure.