DeepNewz, mobile.

People-sourced. AI-powered. Unbiased News.

Screenshot of DeepNewz app showing story detail view.

Screenshot of DeepNewz app showing story list view.

Search

For You

Sources

Loading...

Additional media

Loading...

Similar Stories

Footer

AI

AI Fundraising
AI Modeling
AI Products

Business

Automobile
Company Earnings
Economics
Law
Prediction Markets
Real Estate
Rumors
Stocks
VC

Crypto

Airdrops
Blockchains
CBDCs
DeFi
Hacks
Markets
Memecoin
Mining
NFT
Regulation

Culture

Anime
Celebrities
Crime
Education
Movies
Music
Obituary
TV
Video Games

Environment

Climate
Energy
Natural Disasters
Natural Resources
Sustainability

Politics

DOGE
Epstein Files
Executive Orders
Tariffs
US Domestic Policy
US Elections
US Foreign Policy
US Gov Appointments
US Judiciary
US Legislation
War

Science

Archeology
Bio
Health
Math
Pharma
Physics
Space

Sports

Boxing
Chess
Cricket
Golf
Hockey
MLB
NBA
NCAA
NFL
Olympics
PGA
Poker
Racing
Rugby
Soccer
Tennis
UFC

Tech

AR-VR
Fintech
Infosec
IoT
Metaverse
Policy
Robotics
Smart Home
Software
Startups
Wearables

United States

Arizona
Boston
California
Chicago
Colorado
Detroit
Florida
Georgia
Las Vegas
Los Angeles
New Jersey
New Mexico
New York
Ohio
Oregon
Philadelphia
San Francisco
Seattle
Texas
Utah
Washington DC

World

Terms of Service

WhatsApp YouTube X

© 2025 DeepNFTValue, Inc. All rights reserved.

Jan 27, 10:11 PM

Microsoft's CoRAG Boosts AI Reasoning by Over 10 Points on KILT Benchmark

Microsoft's CoRAG Boosts AI Reasoning by Over 10 Points on KILT Benchmark

Authors

13

Researchers from Microsoft have introduced a novel approach to Retrieval Augmented Generation (RAG) known as Chain-of-Retrieval Augmented Generation (CoRAG). This method enhances the ability of AI systems to retrieve and reason over relevant information step by step before generating final answers. CoRAG addresses the limitations of traditional RAG methods, which typically perform a single retrieval step before generating responses, by allowing dynamic query reformulation based on evolving states. The approach has shown over 10 points improvement in multi-hop question answering tasks, setting a new state-of-the-art performance on the KILT benchmark for knowledge-intensive tasks. The training of CoRAG involves rejection sampling to generate intermediate retrieval chains, augmenting existing RAG datasets.

#Microsoft #Retrieval Augmented Generation

Written with ChatGPT .

Zihan Wang@wzihanw
7 months ago
🚀 Introducing RAGEN—the world’s first reproduction of DeepSeek-R1(-Zero) methods for training agentic AI models! We’re betting big on the future of RL + LLM + Agents 🤖✨. This release is a minimally viable leap toward that vision. Code and more intro 🔗:… https://t.co/AG6lUYjA23
thebes@voooooogel
7 months ago
why did R1's RL suddenly start working, when previous attempts to do similar things failed? theory: we've basically spent the last few years running a massive acausally distributed chain of thought data annotation program on the pretraining dataset. deepseek's approach with R1…
AI Makerspace@AIMakerspace
7 months ago
What we 🏗️ built, 🚢 shipped, and 🚀 shared last week: Agent Evaluation with @ragas_io. We learned: 🪡 It’s all about LLM traces; test-set generation is not ready yet. 📊 Agent metrics must be combined with LLM and RAG metrics! 🎥 Recording: https://t.co/RHudkxJLto…

Image #1 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #2 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #3 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #4 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #5 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #6 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Image #7 for story microsoft-s-corag-boosts-ai-reasoning-over-10-points-on-kilt-benchmark-ab887c4c

Similar Stories

ARC Releases ARC-AGI-3 Preview With Six New Games; Frontier AI Including OpenAI’s o3 Scores 0%, Humans 100%

Authors

9

25 days ago

OpenAI Reasoning System Wins Gold at 2025 IOI, Ranking Sixth Among 330 Humans Without Specific Training

Authors

15

4 days ago

OpenAI’s GPT-5 Excels on MedXpertQA, FrontierMath Tier 4 Problems, and Elimination Game with 4.86 Score

Authors

6

1 day ago

ARC Unveils ARC-AGI-3 Benchmark, Frontier AI Scores 0%

Authors

10

28 days ago

xAI's Grok 4 Scores 60.5% on SimpleBench, Ranks Second Behind Gemini 2.5 Pro as OpenAI and Anthropic Launch New AI Tools

Authors

7

28 days ago

Moonshot Unveils Kimi K2 Technical Report Highlighting MuonClip Optimizer, Agentic Data Pipeline, and transformers.js Deployment

Authors

8

25 days ago

Microsoft Code Leak Signals Copilot Upgrade to GPT-5 'Smart Mode'

Authors

11

22 days ago

Grok 4 Beats Gemini 2.5 Pro 3-2, Outperforms GPT-5 on ARC-AGI, Advances to Kaggle AI Chess Finals

Authors

15

8 days ago

OpenAI Builds ‘Universal Verifier’ to Bolster GPT-5 Training

Authors

10

11 days ago

OpenAI Launches 'Think Longer' Tool in ChatGPT Offering More Detailed Responses, Linked to GPT-5 and Potential Pro/Plus Feature

Authors

8

18 days ago

AI /ChatGPT Features