Microsoft's 100-agent AI vulnerability scanner entered its next phase at Microsoft Build 2026 on June 2, 2026, when the company opened an expanded preview of MDASH — the Microsoft Security multi-model ...
CyberGym benchmark scores over time, showing the rapid improvement in AI vulnerability discovery capabilities. Microsoft’s multi-model MDASH system (top right) tops the leaderboard at 88.4%. (CyberGym ...
Former Google engineer and influential AI researcher François Chollet is co-founding a nonprofit to help develop benchmarks that’ll probe AI for “human-level” intelligence. The nonprofit, the ARC ...