
Google has announced the public preview of Gemini 1.5 Pro on its Vertex AI platform, marking a significant advancement in generative AI capabilities. The model, which supports up to 1M tokens, is designed to power new features in Code Assist and other applications. Gemini 1.5 Pro, described as Google's most capable generative AI model to date, was launched in February and is capable of processing between 128,000 and 1M tokens. This release is part of Google's broader effort to integrate AI more deeply into its cloud services, as highlighted at the Google Cloud Next event. The model now includes the ability to process audio streams, including speech, and has been enhanced with a new File API for easier file handling. Additionally, Google has introduced the ability to enhance Gemini models with Google Search, allowing businesses to integrate these enhanced models into AI agents. The model's widespread availability in over 180+ countries, except the EU, and its new audio (speech) understanding capabilities are expected to facilitate a wide range of applications, from customer service to data analysis.
Google Cloud Next '24 - Gemini Everywhere Gemini Models and API - Gemini 1.5 Pro now available in 180+ countries (still not in EU), with native audio (speech) understanding and 1 million token context window - CodeGemma, a new fine-tuned version of Gemma for code generation and… https://t.co/J9VHC5DRu7
"We have enhanced Gemini 1.5 Pro with the ability to process audio and video ... For instance, you could search a baseball game recording for instances of someone saying 'It’s outta here' in seconds." $GOOG
🎉 It’s a big day for @Google Gemini. Gemini 1.5 Pro now understands audio, uses unlimited files, acts on your commands, and lets devs build incredible things with JSON mode! It’s all 🆓. Here’s why it’s a big deal 👇 🔈 Gemini can hear Gemini understands audio (up to 9.5… https://t.co/HEdP4BMuza










