Princeton researchers reveal that AI agent reliability improves at half the rate of accuracy. A 10-step agent workflow at 90% per-step reliability will fail over 6 times daily — and the industry has no good fix yet.
The Anthropic Institute's first major report introduces 'AI Coverage' — measuring not what AI could do, but what it's actually doing. Computer programmers top the list at 75%. The white-collar recession isn't a prediction anymore.
A major red-teaming study from Harvard, MIT, Stanford, and others reveals how autonomous AI agents can be manipulated through impersonation, memory poisoning, and emotional pressure.