Nvidia has quietly released an open-source AI language model, Nemotron 70B, which is a fine-tuned version of Llama 3.1 70B. According to benchmarks, Nemotron 70B outperforms OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet on several tests, including Arena Hard, AlpacaEval 2 LC, and MT-Bench. On the Arena Hard benchmark, Nemotron scored 85.0, surpassing GPT-4o's 79.3 and Claude 3.5 Sonnet's 79.2. In the AlpacaEval 2 LC test, it achieved a score of 57.6, compared to GPT-4o's 57.5 and Claude 3.5 Sonnet's 52.4. On MT-Bench, Nemotron scored 8.98, outperforming GPT-4o's 8.74 and Claude 3.5 Sonnet's 8.81. The model was trained using Reinforcement Learning from Human Feedback (RLHF) with the REINFORCE algorithm and HelpSteer2-Preference prompts. Nvidia has made the instruct model, reward model, and training data available for free on Hugging Face.
New Model, whodis? Check out @nvidia’s Llama 3.1 Nemotron showing some amazing gains. Now on @akashnet_ chat. Show me another DePIN running production open source AI. $AKT 💪 can. https://t.co/Zpi79opUnm
Llama 3.1 Nemotron 70B is the latest model from NVIDIA, released only a few hours ago. Initial testing shows the model outperforms GPT-4o and Sonnet 3.5 on several benchmarks. Try it on Akash Chat for free: https://t.co/O0kUSzLtbl https://t.co/u1hxvy49M7
Nvidia dropping the llama 3.1 nemotron 70b is just going to make us stronger 💪 Apparently it’s better than GPT-4o and sonnet. https://t.co/bLG5XQazaC