Aug 7, 05:35 PM

xAI’s Grok 4 Heavy Tops OpenAI GPT-5 in Key AI Benchmark

xAI’s flagship large language model, Grok 4 Heavy, has edged out OpenAI’s next-generation GPT-5 in a recently disclosed run of the independent “Humanity’s Last Exam” benchmark, according to data circulating among AI researchers on 7 Aug. Grok 4 Heavy recorded a 44.4% result, compared with GPT-5’s 42.0%, indicating a modest performance lead for the Musk-backed startup in one of the industry’s widely watched stress-tests of reasoning and general knowledge. The result is notable because GPT-5 represents OpenAI’s first major model upgrade since the GPT-4 series, and comes as the Microsoft-backed company strives to maintain its technological edge amid intensifying competition. xAI, founded in 2023, has been positioning Grok as a direct rival to OpenAI models while integrating the system across the X social-media platform and other services. While benchmark scores do not always translate into real-world application quality, the latest figures add to pressure on incumbents as a growing field of challengers demonstrates rapid gains in model capability. Neither company immediately commented on the benchmark comparison.

#OpenAI #Musk #Microsoft #xAI #Grok

Written with ChatGPT .

Sources

Additional media

Image #1 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

Image #2 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

Image #3 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

Image #4 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

Image #5 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

Image #6 for story xais-grok-4-heavy-tops-openai-gpt-5-key-ai-benchmark-7c088873

xAI’s Grok 4 Heavy Tops OpenAI GPT-5 in Key AI Benchmark

Sources

Additional media

Similar Stories

Similar Stories