Feb 28, 04:33 AM

New AI Models Achieve High Reasoning Performance with 3.8 Billion Parameters; Tiny-R1-32B-Preview by Peking University and Qihoo 360 Released

Recent advancements in artificial intelligence have led to the development of state-of-the-art quantized reasoning models based on the DeepSeek-R1-Distill suite. An experimental model, referred to as the 3.8-billion-parameter version, has demonstrated reasoning performance comparable to or exceeding that of larger models, such as DeepSeek-R1-Distill-Qwen-7B and DeepSeek-R1-Distill-Llama-8B. These models utilize FP8 and INT8 quantization techniques, achieving near-perfect accuracy recovery across various reasoning benchmarks. Additionally, the Tiny-R1-32B-Preview model has been released, which outperforms the Deepseek-R1-Distill-70B model and nearly matches the full R1 model in mathematical reasoning. This new model was developed by researchers from Peking University and Qihoo 360, and it is expected to release training and evaluation code soon. The focus of these innovations is to enhance the efficiency of reasoning models, particularly in scaling inference compute while minimizing performance loss.

#Preview #Peking University #Qihoo 360

Written with ChatGPT (GPT-4o mini).