
The Phi-3 Medium (14B) and Small (7B) models from Microsoft have been added to the LMSYS Chatbot Arena Leaderboard. The Medium model ranks near GPT-3.5-Turbo-0613 but is behind Llama 3 8B. The Small model is close to Llama-2-70B and Mistral fine-tunes. Notably, Phi-3 models had beaten Llama 3 8B on MMLU using 3.8B parameters. Meanwhile, Llama-3 leads the open-source large language models (LLMs) category on the Chatbot Arena Leaderboard by a significant margin. Additionally, the MMLU-Pro benchmark has been introduced to measure expert-level intelligence, with GPT-4o, Gemini-1.5-Pro, and Claude-3-Opus currently holding the top three positions.
OpenAI remains by far and away the market leader when it comes to LLM usage, but other companies' LLMs are beginning to reach parity when it comes to performance. https://t.co/tDub9BTtMh AI Agenda by @steph_palazzolo
Remember when @Microsoft released Phi-3 models... 🤔 Yup, the ones that had 🦙Llama 3 8B beat on MMLU using 3.8B parameters! 🏆 Now they are on the LMSYS Chatbot Arena Leaderboard! 📊📈 Medium(14B) ranks near GPT-3.5-Turbo-0613, but behind Llama 3 8B. 📉 Phi-3 Small(7B) is… https://t.co/TZg1VlsF0z
Llama-3 currently holds the top position among open-source large language models (LLMs). ✨ 📌 On the Chatbot Arena Leaderboard, it leads the open-source category by a significant margin, with no comparable rivals. The performance gap between Llama-3 and GPT-4 is surprisingly… https://t.co/L2obJmXyw2


