Google DeepMind's SCoRe teaches AI to fix some of its own mistakes without outside help

Midjourney prompted by The Decoder

Google DeepMind researchers have developed a new technology called SCoRe to help large language models recognize and fix their own mistakes.

Current large language models (LLMs) struggle with self-correction, often requiring multiple models or external checks. SCoRe, which stands for "Self-Correction via Reinforcement Learning," uses reinforcement learning to train a single model using only self-generated data.

SCoRe works in two phases. First, it optimizes model initialization to generate corrections on the second try while keeping initial responses similar to the base model. This uses a special loss function considering both aspects.

The second phase applies multi-stage reinforcement learning. The model learns to improve both first and second answers. A reward function encourages self-correction by giving more weight to improvements between attempts. Unlike methods needing external verification, SCoRe uses only self-generated training data. The model creates its own examples by solving problems and trying to improve solutions.

SCoRe achieves significant self-correction

Tests with Google's Gemini 1.0 Pro and 1.5 Flash models showed significant gains. On the MATH benchmark for mathematical reasoning, self-correction improved by 15.6 percentage points. For code generation on HumanEval, it rose 9.1 percentage points.

The researchers say SCoRe is the first approach achieving meaningful positive intrinsic self-correction, allowing models to improve answers without external feedback.

However, SCoRe currently only trains for one round of self-correction. Future work could explore multiple correction steps.

The team concludes that teaching metastrategies like self-correction requires going beyond standard LLM training. Multi-stage reinforcement learning may offer new possibilities in this area.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Google DeepMind's SCoRe teaches AI to fix some of its own mistakes without outside help

SCoRe achieves significant self-correction

Google Deepmind's Aeneas AI helps historians quickly restore and interpret Roman inscriptions

OpenAI beats Deepseek by a surprisingly wide margin in Google's latest reasoning benchmark

Google develops AI research assistant to accelerate scientific discoveries

Deepseek’s first hybrid model V3.1 surpasses its R1 reasoning model on benchmarks

Meta's human-like chatbot personas can mislead users and result in real-world harm

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google DeepMind's SCoRe teaches AI to fix some of its own mistakes without outside help

SCoRe achieves significant self-correction

Google Deepmind's Aeneas AI helps historians quickly restore and interpret Roman inscriptions

OpenAI beats Deepseek by a surprisingly wide margin in Google's latest reasoning benchmark

Google develops AI research assistant to accelerate scientific discoveries