Anthropic on 5 August introduced Claude Opus 4.1, a mid-cycle upgrade to its most powerful large language model. The company says the new version delivers sharper performance on agentic tasks, complex reasoning and real-world coding, while keeping the same pricing as Claude Opus 4. In internal testing, Opus 4.1 achieved 74.5% on the SWE-bench Verified software-engineering benchmark, eclipsing both its predecessor and rival models such as OpenAI’s o3 and Google’s Gemini 2.5 Pro. Anthropic also reports higher precision in multi-file code refactoring and a lower refusal rate when responding to user requests. The model is immediately available to paid Claude users through the Claude Code interface, Anthropic’s API, Amazon Bedrock and Google Cloud’s Vertex AI. Developers can access it under the identifier “claude-opus-4-1-20250805.” Anthropic said it expects to release “substantially larger improvements” to its model family in the coming weeks, underscoring the escalating competition in generative AI ahead of OpenAI’s anticipated GPT-5 launch.
Anthropic just dropped claude opus 4.1 that beats o3 and gemini 2.5 pro in coding > Opus 4.1 delivers a one standard deviation improvement over Opus 4 > roughly the same performance leap as the jump from Sonnet 3.7 to Sonnet 4 Dario won. https://t.co/HmRJvGVBPY
Anthropic releases Claude Opus 4.1 to paid Claude users, in Claude Code, via its API, and more, featuring broad improvements over Opus 4 for the same cost (Anthropic) https://t.co/6Qvlm0oU2f https://t.co/gKLgzGAhQV https://t.co/ZOzeer2dpR
Anthropic rolls out Claude Opus 4.1 with improved software engineering accuracy https://t.co/X95veiXnp8 by @apollozac