Nvidia has introduced Nemotron-Nano-9B-v2, a 9-billion-parameter open-source language model that blends Mamba state-space layers with transformer blocks. The company says the hybrid design delivers up to six times the throughput of comparably sized transformer models while fitting on a single Nvidia A10 GPU after pruning an earlier 12-billion-parameter version. On internal tests the multilingual model scored 72.1 percent on AIME25, 97.8 percent on MATH500, 64.0 percent on GPQA and 71.1 percent on LiveCodeBench, outperforming the open-source Qwen3-8B on most reasoning benchmarks and topping the Artificial Analysis open-model leaderboard. Developers can toggle chain-of-thought traces on or off with simple tokens and cap the model’s “thinking budget” to trade accuracy for latency. Nemotron-Nano-9B-v2 is available immediately on Hugging Face and Nvidia’s model catalog under the Nvidia Open Model License, which permits free commercial deployment and derivative works provided users keep safety guardrails and attribution. Nvidia also released about three million vision-language training samples and the broader pre-training corpus to spur community adoption. The launch adds to a string of efficiency-focused AI releases from Nvidia. Earlier in the day the company said more than two million developers now build on its robotics software stack, underscoring demand for compact models that can run on edge devices as well as in the data center.
📝 NVIDIA released a massive 3M samples of vision language model training data for OCR, visual question answering, and captioning, built for enterprise documents. It trains Llama 3.1 Nemotron Nano VL 8B V1, which tops OCRBench V2. available on @huggingface The set targets https://t.co/Tg0zdXy6QO
OpenAI and NVIDIA Propel AI Innovation With New Open Models Optimized for the World’s Largest AI Inference Infrastructure https://t.co/7QA1d0QnNE
CrowdStrike, Uber, Zoom Among Industry Pioneers Building Smarter Agents With NVIDIA Nemotron and Cosmos Reasoning Models for Enterprise and Physical AI Applications https://t.co/uzG97DfXA3