
The recent launch of Llama-3.1-Storm-8B marks a significant advancement in large language models (LLMs), outperforming competitors such as Meta's LLaMA and Hermes across various benchmarks. This new model, developed by Homebrew Research, features Llama3-s v0.2, which introduces enhanced multimodal capabilities, allowing it to understand both audio and text inputs. The model utilizes an innovative early fusion approach with semantic tokens, processing audio through a WhisperVQ encoder before generating text responses. The Llama3 tokenizer has also been integrated into GPT-2 training, expanding its vocabulary size to 128,001 tokens compared to GPT-2's 50,257 tokens. The advancements in Llama-3.1's capabilities are indicative of the growing trend towards multimodal AI systems, which are expected to offer broader applications in natural language processing and real-time interaction. Companies are also launching new features to support multimodal monitoring across various AI models, enhancing the infrastructure for LLMs and audio models.
llama 3.1 8b beamed from my laptop to my phone at 45 tok/s with voice from elevenlabs voice chat with your own private, self-hosted language model https://t.co/m094PgmSwi
I made my own ELO score for LLMs. Motivation: determine which LLM is best for my daily usage. I use LLMs for both coding and instruction-following tasks. Top 5 rankings: • sonnet 3.5 • gpt-4o • llama 3.1 (405b) • gpt-4 turbo • gemini 1.5 pro I created my ELO score by… https://t.co/XDitoJp8xi
Llama3 Just Got Ears! Llama3-s v0.2: A New Multimodal Checkpoint with Improved Speech Understanding https://t.co/gn6rPl3yfF #Llama3 #SpokenLanguage #AI #Llama3sV02 #NaturalLanguageProcessing #ai #news #llm #ml #research #ainews #innovation #artificialintelligence #machinelear… https://t.co/wLEnmTCOON



