Oct 3, 02:40 PM

Meta AI Releases Llama 3.2 with Multimodal Capabilities and Record AI Inference Speeds by Groq Inc.

Meta AI has announced the release of Llama 3.2, the latest in its series of open-source AI models. The new models include multimodal capabilities, combining text and image processing, and are available in various sizes including 1B, 3B, 11B, and 90B parameters. The models are designed for visual reasoning, image captioning, and visual question answering (VQA) tasks. Llama 3.2 models are optimized for edge and mobile devices, balancing performance and size efficiently. Notably, SambaNova Cloud has been independently verified as the fastest in AI inference, achieving 2470 tokens per second on the 1B model and 1566 tokens per second on the 3B model. Groq Inc. has set a world record, delivering over 3,000 tokens per second on the 1B model. Llama 3.2 outperforms Claude 3 and GPT-4 Mini. These advancements position Llama 3.2 as a competitive option in the AI landscape, particularly for applications requiring efficient on-device computation.

#Meta AI #Llama #SambaNova Cloud

Written with ChatGPT (GPT-4o).