DeepNewz, mobile.
People-sourced. AI-powered. Unbiased News.
Download on the App Store
Screenshot of DeepNewz app showing story detail view.
Aug 6, 07:23 AM
Anthropic Releases Claude Opus 4.1 With 74.5% SWE-Bench Score, Outperforming OpenAI o3 and Gemini 2.5 Pro
AI Modeling
AI Products
AI

Anthropic Releases Claude Opus 4.1 With 74.5% SWE-Bench Score, Outperforming OpenAI o3 and Gemini 2.5 Pro

Authors
  • 9to5Mac
  • Anthropic
  • Techmeme
25

Anthropic has released Claude Opus 4.1, an upgrade to its flagship AI model Claude Opus 4, focusing on enhanced performance in agentic tasks, real-world coding, and complex reasoning. The update, available to paid users via Claude Code, API, Amazon Bedrock, and Google Cloud's Vertex AI at no additional cost, delivers a one standard deviation improvement over its predecessor. Claude Opus 4.1 achieves a coding performance score of 74.5% on the SWE-bench Verified benchmark, surpassing the previous 72.5% and outperforming competitors such as OpenAI's o3, Gemini 2.5 Pro, and Qwen-3 Coder in coding and agentic tasks. Key strengths include multi-file code refactoring, debugging, analytics, and improved context understanding for more accurate and helpful responses. This update marks Anthropic's quickest upgrade cycle, arriving just two months after Opus 4, with further substantial improvements expected in the coming weeks. The model is integrated into platforms like Poe and has been praised for its solid coding capabilities and continuous delivery of enhancements through Claude Code.

Written with ChatGPT (GPT-4).

Additional media

Image #1 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #2 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #3 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #4 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #5 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #6 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #7 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #8 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #9 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #10 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #11 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #12 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #13 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #14 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #15 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf
Image #16 for story anthropic-releases-claude-opus-4-1-74-5-swe-bench-score-outperforming-openai-o3-2bf14fdf