AI in practice

May 4, 2025May 4, 2025

Deepmind expert says trimming documents improves accuracy despite large context windows

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

How useful are million-token context windows, really? In a recent interview, Nikolay Savinov from Deepmind explained that when a model is fed many tokens, it has to distribute its attention across all of them. This means focusing more on one part of the context automatically leads to less attention for the rest. To get the best results, Savinov recommends including only the content that is truly relevant to the task.

I'm just talking about-- the current reality is like, if you want to make good use of it right now, then, well, let's be realistic.

Nikolay Savinov

Recent research supports this approach. In practice, this could mean cutting out unnecessary pages from a PDF before sending it to an AI model, even if the system can technically process the entire document at once.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Google via YouTube

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI in practice

Dec 9, 2023Dec 9, 2023

Update

Gemini co-lead Oriol Vinyals addresses criticism of Google Deepmind's staged multimodal demo

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Dec 6, 2023Dec 6, 2023

Google rolls out AI model "Gemini Pro", "Gemini Ultra" to beat GPT-4

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Deepmind expert says trimming documents improves accuracy despite large context windows

Gemini co-lead Oriol Vinyals addresses criticism of Google Deepmind's staged multimodal demo

Google rolls out AI model "Gemini Pro", "Gemini Ultra" to beat GPT-4

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Deepmind expert says trimming documents improves accuracy despite large context windows

Gemini co-lead Oriol Vinyals addresses criticism of Google Deepmind's staged multimodal demo

Google rolls out AI model "Gemini Pro", "Gemini Ultra" to beat GPT-4