

Google has launched Gemma, a new series of optimized AI models, in collaboration with NVIDIA. The partnership aims to leverage NVIDIA's TensorRT-LLM for enhanced performance on RTX GPUs, including the upcoming feature, Chat with RTX. Gemma is designed to work seamlessly with GPUs, and it's already supported by popular no-code fine-tuning tool 🤗 AutoTrain. The launch includes integration with Hugging Chat, a fine-tune script, availability of transformers and GGUF, a free Google Colab example, Flash Attention 2, and JAX weights. This development marks Google's entry into the open-source Large Language Model (LLM) space, offering tools and integrations for the community to build upon.
Here are today's AI headlines: [1] Google Unveils Gemma: A New Era of Open Large Language Models with Commercial and Research Applications [2] Google Workspace Introduces Gemini [3] Nokia and Nvidia Forge Alliance to Revolutionize Mobile Networks with Artificial Intelligence…
Take a look at today's Daily Brief for the latest sci-tech news, such as: 🤝 Intel strikes major chip deal with Microsoft 🤖 Google shares ‘open’ AI models 📈 Nvidia gets a revenue boost ...and more! https://t.co/PIiaUPbFDl
Gemma is an amazing step from Google. Looking forward to seeing what the community builds with it! https://t.co/GjredDcKNN https://t.co/jxqf2JFWxW