Nexusflow has launched Athene-V2, a new open model suite with 72 billion parameters, aimed at competing with OpenAI's GPT-4o across various benchmarks. Athene-V2 reportedly excels in chat, code, and math tasks, surpassing GPT-4o in single function calls by 18%. The model's performance in specific benchmarks includes scores of 84.9 for GPT-4o, 77.9 for Athene-V2, and 69.3 for Llama 3.1 405B in the Arena Hard benchmark, while in the Bigcode-Bench Hard benchmark, Athene-V2 scored 31.4 compared to GPT-4o's 30.8. Additionally, Athene-V2 is designed to address the current trends in AI development by focusing on targeted post-training to enhance model capabilities. The model is now available for use in LocalAI, where users can access its various functionalities, including the Athene-v2-agent and Athene-v2-chat models, which are specifically tailored for agentic and conversational tasks.
Athene-V2: a pair of strongly performing 72B models, trained on top of Qwen 2.5 72B instruct. Tool Use and agentic use cases: lms get athene-v2-agent Chat, math, and coding use cases: lms get athene-v2-chat Or via LM Studio's in app search (ctrl/cmd + shift + M). Requires… https://t.co/1yCwiXJ0g1 https://t.co/uqshzNbQxU
Congrats @NexusflowX on the latest Athene-V2-72B release, matching top models across hard benchmarks! Now it comes the real test—Athene is live in Arena for human evaluation. Come ask tough prompts at lmarena. ai! https://t.co/WGSl4G0tOI
Introducing Athene-V2-Chat! 🎉 A new powerful model on par with GPT-4o across benchmarks, trained through RLHF with Qwen-2.5-72B-Instruct. Try it on LocalAI: local-ai run athene-v2-chat #AI #NLP #Chatbot