Jun 6, 07:02 PM

Alibaba Launches Qwen2 AI Models, 72B Outperforms Meta's Llama 3 with High Scores

Alibaba has officially launched Qwen2, an advanced open-source AI model series that includes both base and instruct models in five sizes: 0.5B, 1.5B, 7B, 57B-A14B, and 72B parameters. These models have been trained on data in 27 additional languages and are designed to excel in code and math capabilities. Qwen2-72B, the largest model, outperforms Meta's Llama 3 70B and has a 128K context window. The models are available under the Apache 2.0 license, except for the 72B model. Qwen2 models have achieved impressive results on the Open LLM Leaderboard and MixEval, with the Qwen2-72B-Instruct model leading in several key evaluations, including an MMLU score of nearly 84 and a GSM8K score of 85. The Qwen2-72B-Instruct model is comparable to GPT-4o with an MMLU score of 84.32. The models are also open-sourced with MLX support and can be accessed on Hugging Face. Additionally, Qwen2 can be deployed using SkyPilot.