Jul 18, 05:56 PM

ARC Unveils ARC-AGI-3 Benchmark, Frontier AI Scores 0%

ARC has released a developer preview of ARC-AGI-3, the latest version of its Artificial General Intelligence benchmark designed to test interactive reasoning. The preview makes three of six planned game-like environments publicly available, together with an API that lets researchers run their own agents against the tasks. Initial results underscore the challenge: leading AI systems scored 0% while human participants achieved 100% across the released levels. ARC said the new suite, a step up from earlier versions that focused on static reasoning, is intended to probe skills such as planning and real-time decision making that conventional deep-learning models struggle with. To spur progress, ARC opened a $10,000 contest for the first agent that can solve the current tasks. A full release of ARC-AGI-3 is scheduled for early 2026, and the group plans to incorporate community feedback gathered during the preview period.

#ARC #Artificial General Intelligence

Written with ChatGPT .

Sources

Additional media

Image #1 for story arc-unveils-arc-agi-3-benchmark-frontier-ai-scores-0-87da4414

Image #2 for story arc-unveils-arc-agi-3-benchmark-frontier-ai-scores-0-87da4414

Image #3 for story arc-unveils-arc-agi-3-benchmark-frontier-ai-scores-0-87da4414

Image #4 for story arc-unveils-arc-agi-3-benchmark-frontier-ai-scores-0-87da4414

Image #5 for story arc-unveils-arc-agi-3-benchmark-frontier-ai-scores-0-87da4414

ARC Unveils ARC-AGI-3 Benchmark, Frontier AI Scores 0%

Sources

Additional media

Similar Stories