A series of new models have been added to the leaderboard, showcasing a range of performance metrics across various categories. Notably, one model achieved an overall rank of 48, ranking first in the 13B category with an average score of 40.69, an IFEval score of 73.05, and a BBH score of 49.51. Another model ranked 44 in the 13B category with an average score of 41.22 and an IFEval score of 72.11. Other models included those ranked 127 with an average of 37.48 and an IFEval of 53.61, and 606 with an average of 28.96 and an IFEval of 73.71. Additionally, Aya Expanse has been recognized as the best open-weights model on @scale's private multilingual protocol, outperforming both proprietary and larger models in certain languages, indicating strong multilingual capabilities.
New model added to the leaderboard! Model Name https://t.co/nwGhd9bkfC Overall rank: 1743 Rank in 35B category: 126 Benchmarks Average: 20.12 IFEval: 41.86 BBH: 17.15 MATH Lvl 5: 0.0 GPQA: 4.59 MUSR: 16.14 MMLU-PRO: 40.96
New model added to the leaderboard! Model Name https://t.co/LsTIF4gVfW Overall rank: 508 Rank in 7B category: 60 Benchmarks Average: 30.09 IFEval: 78.41 BBH: 35.15 MATH Lvl 5: 16.84 GPQA: 6.82 MUSR: 10.35 MMLU-PRO: 32.98
New model added to the leaderboard! Model Name https://t.co/K9ELce9DmD Overall rank: 955 Rank in 7B category: 325 Benchmarks Average: 25.01 IFEval: 51.19 BBH: 32.68 MATH Lvl 5: 17.82 GPQA: 7.38 MUSR: 11.62 MMLU-PRO: 29.36