Chinese AI company DeepSeek has released an upgraded version of its flagship open-weight model, DeepSeek-R1-0528, which demonstrates notable improvements in reasoning, mathematics, programming, and logic. This model is reported to rival the performance of closed AI models such as OpenAI's o3 and Google's Gemini-2.5 Pro across several benchmarks. An 8-billion parameter distilled variant, DeepSeek-R1-0528-Qwen3-8B, outperforms Alibaba's Qwen3-8B model by one IQ point, achieves a 52 IQ score, and runs efficiently on a single 16GB GPU. There are claims that DeepSeek's latest model may have been trained using data from Google's Gemini AI, raising concerns among developers. Meanwhile, Alibaba has unveiled new open-source AI embedding models, maintaining its global leadership in text-embedding services as recognized by Hugging Face benchmarks. Alibaba's Qwen models are acknowledged by Nvidia CEO Jensen Huang as among the best open-source AI models, contributing positively to U.S. AI development. The release of DeepSeek's and Alibaba's models highlights China's growing leadership in open-source AI, with numerous startups emerging to develop AI agents that automate tasks ranging from app development to travel planning. Additionally, OpenThinker3, a 7-billion parameter model trained solely with supervised fine-tuning and no reinforcement learning, has been introduced, outperforming other open 7B and 8B models in math, code, and science. This surge in Chinese AI innovation follows the popularity of Butterfly Effect's Manus AI agent, which has catalyzed a boom in AI agent development within China despite local internet restrictions.
How Alibaba Helped China Take the Lead From the U.S. in Open-Source AI Nvidia CEO Jensen Huang acknowledges Alibaba's Qwen as being among the best open-source AI models, noting the benefits for the U.S. Discover more: https://t.co/uRg7dt6Iuz #AIGlobalImpact
El nuevo modelo de DeepSeek parece deber mucho a Google: parece que ha sido entrenado con la IA de Gemini https://t.co/2bEyg7FrdT
DeepSeek’s new R1-0528-Qwen3-8B is the fastest 8B model to hit 52 IQ score, beats Qwen3 8B by 1pt, fits on 1 GPU. And its now the highest performing 8B parameter model they have tested. It is a distilled version of the DeepSeek-R1-0528 model. It was created by fine-tuning the https://t.co/uAcejEsUpf https://t.co/QuPALyXb4i