NexusflowX's Starling-7B Ranks 13th, Beats Competitors

OMG! This is Insane!! A 7B Model is now beating GPT 3.5 in LMSYS Chatbot Arena—a.k.a. the ONLY BENCHMARK that matters because it is based on blind human eval and can't be gamed. Starling-7B scores on top GPT 3.5, Mistral, and Gemini Pro!! 🤯🤯 Link - https://t.co/RaiSvtd8Jc https://t.co/HVcukGGmrp

Lewis Tunstall@_lewtun

2 years ago

This new Starling model shows that we haven't quite hit the ceiling on 7B fine-tunes - unlike other public benchmarks, the Chatbot Arena is relatively hard to game It would be very interesting to see the @NexusflowX recipe applied to #DBRX! https://t.co/Bvy3eUCxnv

Banghua Zhu@BanghuaZ

2 years ago

DBRX is an amazing masterpiece! If you're looking for smaller models for your use cases, plz give Starling-7B a try, which seems not too bad according to chatbot arena! https://t.co/m2EPKKGzxO

Sources

Additional media

Similar Stories

Similar Stories