Mar 20, 04:33 PM

Starling AI Introduces Starling-LM-7B-beta and Starling-RM-34B, Surpassing Previous Models in RewardBench Evaluation

Starling AI introduces cutting-edge language models Starling-LM-7B-beta and Starling-RM-34B, surpassing previous models in benchmarks. The community lauds the new RewardBench benchmark evaluating 30+ reward models, with Starling-34B-RM leading the leaderboard.

#Starling #RewardBench

Written with ChatGPT (GPT-3).

Sources

Jian Zhang@JianZhangCS
2 years ago
Great work from @BanghuaZ and team Top ranking reward model on reward bench from @allen_ai and new starling beta for chat https://t.co/otnorkxpvR
Nouha Dziri@nouhadziri
2 years ago
Reward models are the essence of success in RLHF, yet there has been little focus on evaluating them 😬 We introduce RewardBench💥 the first benchmark for reward models. We evaluated 30+ of the existing RMs (w/ DPO) and created new datasets. Discover lots of insightful analyses👇 https://t.co/q9XvVpPDwD
Jian Zhang@JianZhangCS
2 years ago
📢Exciting release of Starling-7B-beta chat model and Starling-34B-RM reward model powered by Nexusflow latest technology. I am continuously amazed by how fast and powerful the small striking team behind Starling is! https://t.co/luF4RUVqBe

Starling AI Introduces Starling-LM-7B-beta and Starling-RM-34B, Surpassing Previous Models in RewardBench Evaluation

Sources

Additional media

Similar Stories