Apr 29, 09:27 PM

Nous Research Releases Atropos RL Framework With 5x Improvement on Berkeley AI Benchmark for LLMs

Nous Research has launched Atropos, a new reinforcement learning (RL) environments framework designed to enhance the training of large language models (LLMs). Atropos facilitates scalable and distributed RL systems that enable models to improve reasoning and alignment through trial-and-error interactions, distinguishing RL from traditional fine-tuning methods. The framework has demonstrated notable performance improvements, including a fivefold increase on the Berkeley AI function calling benchmark with a specialized tool-calling model. The release marks a key development in advancing RL techniques for AI, supported by contributions from academic institutions such as Fudan University, Northwestern University, UC Berkeley, Carnegie Mellon University, Princeton University, New York University, Stanford University, and The University of Hong Kong in related machine learning research.