AI research

Aug 23, 2025Aug 23, 2025

Matthias Bastian

Higher token consumption can reduce the efficiency of open reasoning models

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

Open-weight reasoning models often use far more tokens than closed models, making them less efficient per query, according to Nous Research. Models like DeepSeek and Qwen use 1.5 to 4 times more tokens than OpenAI and Grok-4—and up to 10 times more for simple knowledge tasks. Mistral's Magistral models stand out for especially high token use.

Ad

Average tokens used per task by different AI models. | Image: Nous ResearchIn contrast, OpenAI's gpt-oss-120b, with very short reasoning paths, shows that open models can be efficient, especially for math problems. Token usage depends heavily on the type of task. Full details and charts are available at Nous Research.

Ad

Ad

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

High token use can offset low prices in open models. | Image: Nous Research

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Nous Research

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI research

Nov 5, 2025Nov 5, 2025

German Commons shows that big AI datasets don’t have to live in copyright limbo

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Oct 30, 2025Oct 30, 2025

OpenAI releases gpt-oss-safeguard open source models for flexible AI safety

AI in practice

Oct 3, 2025

IBM's Granite 4.0 family of hybrid models uses much less memory during inference

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.