Dec 4, 10:11 AM

New LLM Leaderboard Reveals Top Models, Including Rank 1 in 70+B Category with 52.02 Average Score

A new leaderboard for large language models (LLMs) has been released, showcasing various models and their performance metrics across different categories. Notably, a model achieved the top overall rank of 1 in the 70+B category, with an average score of 52.02 and an IFEval score of 80.63. Other models ranked highly include one in the 35B category, which secured the 7th position with an average score of 36.2, and another in the 13B category that ranked 1st with an average of 39.43. The leaderboard also highlights models in the 1.5B and 7B categories, with ranks ranging from 271 to 642. Performance benchmarks such as IFEval, BBH, and MMLU-PRO are included for each model, demonstrating their capabilities in various tasks. The leaderboard reflects the ongoing advancements in LLMs and their competitive landscape.

#IFEval

Written with ChatGPT (GPT-4o mini).

Sources

Additional media

Image #1 for story new-llm-leaderboard-reveals-top-models-including-rank-1-70-b-category-52-02-6a4fb178

New LLM Leaderboard Reveals Top Models, Including Rank 1 in 70+B Category with 52.02 Average Score

Sources

Additional media

Similar Stories