DeepSeek-V2-0628 Released: An Improved Open-Source Version of DeepSeek-V2 Read our take on this: https://t.co/Ie4AgML86b Model Card: https://t.co/vyH0A2oo7s API Access: https://t.co/tl05k9VJDw DeepSeek-V2-Chat-0628 is an enhanced iteration of the previous DeepSeek-V2-Chat… https://t.co/ATNhyPUtpb
Human specialists excel through education & experience. This applies to LLMs: post-training of free/open models can produce top performers on various benchmarks. Athene-70B, a fine-tuned version of Llama-3-70B, is ascending to the top of Arena-Hard. #OpenAlwaysWins https://t.co/kLAxgGRGO0
📢 Excited to release Athene-Llama3-70B chat LLM, delivering new record on Arena Hard from @lmsys Chatbot Arena! 🔥For the first time, open-weight models really breathe down the neck of Claude-3.5 and GPT-4o on Arena Hard. 🛠️Athene-70B comes from @NexusflowX targeted… https://t.co/Gjq5J4Up6u

Recent advancements in open-weight chat models have been announced, showcasing significant improvements in performance and capabilities. DeepSeek-V2-Chat-0628 has achieved Rank 1 in the Open Weight Model category in Chatbot Arena and is ranked #11 overall, outperforming other open-source models. It also holds the #3 position in both the Coding Arena and Hard Prompts Arena. Additionally, Athene-Llama3-70B, a fine-tuned version of Llama-3-70B from Meta AI, has been released by NexusflowX. This model has set a new record on Arena-Hard with a score of 77.8%, approaching the performance of top proprietary models like Claude-3.5 and GPT-4o. ELO ratings for these models are expected soon. These developments highlight the potential of post-training in enhancing the capabilities of open-source models, making them competitive with leading proprietary solutions. API Access and Model Card for DeepSeek-V2-Chat-0628 are also available.


