
Kyutai Labs has launched the Helium-1 Preview, a 2 billion parameter multilingual large language model (LLM) designed for edge and mobile devices. The model, which is released under a CC-BY-4.0 license, has been trained on 2.5 trillion tokens and features a 4096 context size. It is reported to outperform or be comparable to other models such as Qwen 2.5 1.5B, Owen 1.5B, Gemma 2B, and Llama 3B. The Helium-1 model is now available for developers to utilize on the Hugging Face platform, with plans for stronger text models and multimodal capabilities in the future.
.@kyutai_labs just dropped Helium-1 Preview on @huggingface ๐ฒ 2B Base LLM for mobile / edge ๐ Multi-lingual: ๐ฌ๐ง๐ซ๐ท๐ฉ๐ช๐ช๐ธ๐ฎ๐น๐ต๐น โ๏ธ CC-BY-4.0 license ๐ Available today in transformers (pull main) Helium-1 Preview shows strong capabilities in multi-modal, and a full model is in theโฆ https://t.co/DQzgZGREi4
We just released the Helium-1 model , a 2B multi-lingual LLM which @EXGRV and @lmazare have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks ๐ฌ๐ง๐ฎ๐น๐ช๐ธ๐ต๐น๐ซ๐ท๐ฉ๐ช On HF, under CC-BY licence: https://t.co/zpEo6dmJXV https://t.co/9GtZdzwzqV https://t.co/p5ZyZsICFO
Very proud to see our first standalone text LLM released. In our multimodal roadmap, we plan on releasing stronger text models along with audio, vision and more! https://t.co/Ez2JZlWNOz