DeepNewz, mobile.

People-sourced. AI-powered. Unbiased News.

Screenshot of DeepNewz app showing story detail view.

Screenshot of DeepNewz app showing story list view.

Search

For You

Sources

Loading...

Additional media

Loading...

Similar Stories

Footer

AI

AI Fundraising
AI Modeling
AI Products

Business

Automobile
Company Earnings
Economics
Law
Prediction Markets
Real Estate
Rumors
Stocks
VC

Crypto

Airdrops
Blockchains
CBDCs
DeFi
Hacks
Markets
Memecoin
Mining
NFT
Regulation

Culture

Anime
Celebrities
Crime
Education
Movies
Music
Obituary
TV
Video Games

Environment

Climate
Energy
Natural Disasters
Natural Resources
Sustainability

Politics

DOGE
Epstein Files
Executive Orders
Tariffs
US Domestic Policy
US Elections
US Foreign Policy
US Gov Appointments
US Judiciary
US Legislation
War

Science

Archeology
Bio
Health
Math
Pharma
Physics
Space

Sports

Boxing
Chess
Cricket
Golf
Hockey
MLB
NBA
NCAA
NFL
Olympics
PGA
Poker
Racing
Rugby
Soccer
Tennis
UFC

Tech

AR-VR
Fintech
Infosec
IoT
Metaverse
Policy
Robotics
Smart Home
Software
Startups
Wearables

United States

Arizona
Boston
California
Chicago
Colorado
Detroit
Florida
Georgia
Las Vegas
Los Angeles
New Jersey
New Mexico
New York
Ohio
Oregon
Philadelphia
San Francisco
Seattle
Texas
Utah
Washington DC

World

Terms of Service

WhatsApp YouTube X

© 2025 DeepNFTValue, Inc. All rights reserved.

Feb 1, 06:16 PM

OpenAI's O3-Mini-High Tops Coding Benchmarks With 82.74 Score, Offers Cost and Speed Advantages

OpenAI's O3-Mini-High Tops Coding Benchmarks With 82.74 Score, Offers Cost and Speed Advantages

Authors

19

OpenAI's o3-mini-high model has emerged as the leading choice for coding tasks, outperforming competitors such as DeepSeek R1, o1, and Claude Sonnet 3.5 in various benchmarks. On LiveBench, o3-mini-high achieved a coding average score of 82.74, significantly higher than o1's 69.69, Claude 3.5 Sonnet's 67.13, and DeepSeek R1's 66.74. The model's performance, combined with its speed and cost-effectiveness—being approximately 2 times cheaper than Sonnet and 15 times cheaper than o1, while also being about 5 times faster than other models—is expected to shift coding workloads towards o3-mini-high.

#DeepSeek R1 #Claude Sonnet #LiveBench #Claude #Sonnet

Written with ChatGPT .

Image #1 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #2 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #3 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #4 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #5 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #6 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #7 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #8 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #9 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #10 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Image #11 for story openai-s-o3-mini-high-tops-coding-benchmarks-82-74-score-offers-cost-speed-1dc3b587

Chris Průcha@chrisprucha
6 months ago
o3-mini and o3-pro imply the existence of o3-pro-max
Lin Cheng@LinChengDotUK
6 months ago
Currently, the o3-mini-high model is the optimal choice for my coding at the backend, surpassing both the DeepSeek R1 and the o1-pro models in performance.
wh@nrehiew_
6 months ago
Ran this eval again with o3 mini high and it sets a clear SOTA of 32/38 https://t.co/9MT00sJpWt https://t.co/mkY4GvL8iV

AI /ChatGPT Features

Similar Stories

Anthropic’s Claude Opus 4.1 Tops Coding Tests as OpenAI Pushes Open Models

Authors

24

10 days ago

Anthropic Releases Claude Opus 4.1 With 74.5% SWE-Bench Score, Outperforming OpenAI o3 and Gemini 2.5 Pro

Authors

25

10 days ago

xAI's Grok 4 Scores 60.5% on SimpleBench, Ranks Second Behind Gemini 2.5 Pro as OpenAI and Anthropic Launch New AI Tools

Authors

7

28 days ago

OpenAI Tests 'o3-alpha-responses-2025-07-17' Model on WebArena, Beats o3-Pro, Creates Minecraft and GTA

Authors

5

28 days ago

OpenAI Adds $10 Deep Research and Web Search Tools to o3 Models

Authors

14

27 days ago

OpenAI’s GPT-5 Excels on MedXpertQA, FrontierMath Tier 4 Problems, and Elimination Game with 4.86 Score

Authors

6

1 day ago

Anthropic Overtakes OpenAI in Enterprise AI Model Share

Authors

8

13 days ago

Human Coder Dębiak Edges OpenAI Model in 10-Hour AtCoder Final

Authors

13

25 days ago

OpenAI Halves GPT-5 Latency in Cursor and Cuts Cached-Token Fees

Authors

13

2 days ago

OpenAI Unveils GPT-OSS Models and Prepares $500 B Share Deal

Authors

121

9 days ago