Aug 14, 03:03 PM

GPT-5 Beats Pokémon Red in One-Third the Steps of Predecessor

OpenAI’s experimental GPT-5 agent has completed the 1996 video game “Pokémon Red” in 6,470 in-game decision steps, according to developers involved in the test. The run required roughly one week of continuous play, less than half the 15 days logged by the company’s earlier o3 model. The result marks a three-fold reduction in the number of steps compared with o3’s 18,184-step play-through, translating to an estimated 200 percent gain in efficiency. Observers said GPT-5 also outperformed rival large language models, including Anthropic’s Claude and Google’s Gemini, although detailed figures for those systems were not disclosed. While beating a vintage role-playing game is far removed from commercial applications, researchers view the exercise as a proxy for an AI system’s ability to plan, adapt and optimise over long sequences of actions—skills that could translate to robotics, code generation and complex decision-making tasks.

#OpenAI #Pokémon Red #GPT #Anthropic #Claude #Google #Gemini

Written with ChatGPT .