Moonshot AI, an Alibaba-backed artificial-intelligence startup, has open-sourced its new large language model, Kimi K2. The company and early users say the system approaches GPT-4-level instruction following while remaining free to download and deploy. Kimi K2 employs a mixture-of-experts architecture with roughly one trillion parameters and can handle up to 128,000 tokens of context, allowing it to process long documents and code bases. The Unsloth developer group has already released a 1.8-bit version that shrinks the original 1.1-terabyte weights to 245 gigabytes—an 80 % reduction—so the model can run on high-end local hardware. Cloud providers and tool makers moved quickly to integrate the model. Groq reports inference speeds of about 185 tokens per second, while Anycoder, NetMind and OpenRouter have added Kimi K2 endpoints for rapid application development and agent frameworks. The rapid uptake underscores the demand for high-performance, open-source alternatives to proprietary AI systems.
Grok 4 and Kimi K2 competing on top of the Trending models charts https://t.co/r1abQUefoW
POV: You’re building agents. You need long context, precision, and speed. You don’t want a black box. Kimi-K2-Instruct is now live on NetMind. Trillion-param MoE. 128K context. GPT-4-level instruction following. Open source. Infra ready. https://t.co/1CjcTa3jiR https://t.co/6qlulAkOol
With Kimi K2 on @GroqInc, it's time to build a deep multi-agent research clone. I'll be trying to use my newly upgraded Cursor clone app to do it.