May 15, 05:55 PM

Alibaba Unveils Qwen3 Open-Source AI Models From 0.6B to 32B Parameters With 1.2-Second Latency and Advanced Reasoning

Alibaba has released Qwen3, a new family of open-source large language models (LLMs) featuring a Mixture-of-Experts (MoE) architecture. The models range in size from 0.6 billion to 32 billion parameters and are designed for advanced reasoning, coding, instruction following, and multilingual tasks. Qwen3 demonstrates efficient reasoning capabilities with low latency, exemplified by the Qwen3-32B model running on Cerebras hardware, which achieves 1.2 seconds reasoning latency and processes over 2,400 tokens per second. The Qwen3 models have quickly gained traction, with derivative models exceeding 100,000 and integration into platforms such as Clarifai and Hugging Face. The release marks a notable advancement in open-source AI, positioning Qwen3 as a competitive alternative to existing models like GPT-4o and Claude.

#Alibaba #Qwen3 #Cerebras #Clarifai #Hugging Face #Claude

Written with ChatGPT (GPT-4).