Sources
🍓🍓🍓i’ve wanted a top tier benchmark since lmsys died slowly and painfully is this it: https://t.co/1ahlo49QHs credit: @AIExplainedYT
🍓🍓🍓id seen this somewhere but hadn’t realised it was @AIExplainedYT this will likely become the best llm benchmark in all of texas. nice kol. https://t.co/OkVMNHwv5e
Kol TregaskesPhilip (@AIExplainedYT) got fed up with all these poor-quality benchmarks and made one himself If you watch even a handful of his videos you'll know AI Explained is not impressed with the popular LLM benchmarks, particularly MMLU and HellSwag. So Philip has produced his own… https://t.co/0jsY6RXjoC


