Jun 4, 02:07 PM

Microsoft's Phi-3 Models Added to LMSYS Chatbot Arena Leaderboard, Llama-3 Leads Open-Source LLMs

The Phi-3 Medium (14B) and Small (7B) models from Microsoft have been added to the LMSYS Chatbot Arena Leaderboard. The Medium model ranks near GPT-3.5-Turbo-0613 but is behind Llama 3 8B. The Small model is close to Llama-2-70B and Mistral fine-tunes. Notably, Phi-3 models had beaten Llama 3 8B on MMLU using 3.8B parameters. Meanwhile, Llama-3 leads the open-source large language models (LLMs) category on the Chatbot Arena Leaderboard by a significant margin. Additionally, the MMLU-Pro benchmark has been introduced to measure expert-level intelligence, with GPT-4o, Gemini-1.5-Pro, and Claude-3-Opus currently holding the top three positions.

#Small #Microsoft #LMSYS Chatbot Arena Leaderboard #Medium #Llama #Mistral #Chatbot Arena Leaderboard #GPT

Written with ChatGPT (GPT-4o).