
Cohere's Command R model gains support from various platforms like Ollama and Sourcegraph, with Ollama requiring the latest version for compatibility. The model's performance, particularly in the Chatbot Arena Leaderboard, has been impressive, outperforming larger models. REWARDBENCH, a new benchmark dataset and codebase, aims to evaluate reward models in AI alignment, addressing the lack of evaluation in this area. Cohere's Command R model has risen to the top 10 in the Arena leaderboard, showcasing its effectiveness in handling longer contexts.
[Arena Update] @cohere's Command R is now top-10 in Arena leaderboard🔥 It's now one of the best open models reaching the level of top proprietary models. We find the model great at handling longer context, which we plan to separate as a new category in Arena very soon.… https://t.co/ezIc5H6frM https://t.co/7AAOLO3emP
Experimental local chat with Cody and @ollama dropped a few weeks ago. Now you can run local inference for Cody Chat and Commands with any model pulled from Ollama. Happy hacking! Learn how 👇 https://t.co/PG4Znwul9D
While reward models (RMs) play a crucial role in AI alignment, their evaluation has been largely overlooked. A recent study introduced REWARDBENCH, a benchmark dataset and codebase designed to assess RMs across various tasks. 1/2 https://t.co/PFjzF6OS0p






