OpenAI remains by far and away the market leader when it comes to LLM usage, but other companies' LLMs are beginning to reach parity when it comes to performance. https://t.co/tDub9BTtMh AI Agenda by @steph_palazzolo
Remember when @Microsoft released Phi-3 models... 🤔 Yup, the ones that had 🦙Llama 3 8B beat on MMLU using 3.8B parameters! 🏆 Now they are on the LMSYS Chatbot Arena Leaderboard! 📊📈 Medium(14B) ranks near GPT-3.5-Turbo-0613, but behind Llama 3 8B. 📉 Phi-3 Small(7B) is… https://t.co/TZg1VlsF0z
Llama-3 currently holds the top position among open-source large language models (LLMs). ✨ 📌 On the Chatbot Arena Leaderboard, it leads the open-source category by a significant margin, with no comparable rivals. The performance gap between Llama-3 and GPT-4 is surprisingly… https://t.co/L2obJmXyw2

The Phi-3 Medium (14B) and Small (7B) models from Microsoft have been added to the LMSYS Chatbot Arena Leaderboard. The Medium model ranks near GPT-3.5-Turbo-0613 but is behind Llama 3 8B. The Small model is close to Llama-2-70B and Mistral fine-tunes. Notably, Phi-3 models had beaten Llama 3 8B on MMLU using 3.8B parameters. Meanwhile, Llama-3 leads the open-source large language models (LLMs) category on the Chatbot Arena Leaderboard by a significant margin. Additionally, the MMLU-Pro benchmark has been introduced to measure expert-level intelligence, with GPT-4o, Gemini-1.5-Pro, and Claude-3-Opus currently holding the top three positions.


