AI in practice

May 10, 2025May 10, 2025

Gemini Flash 2.5 becomes 150 times more expensive for reasoning tasks than Flash 2.0

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

Reasoning tasks sharply raise AI costs, according to a new analysis by Artificial Analysis. Google's Gemini Flash 2.5 costs 150 times more to run than Flash 2.0, due to using 17 times more tokens and charging $3.50 per million output tokens with reasoning, compared to $0.40 for the earlier model. This makes Flash 2.5 the most expensive model in terms of token use for logic. OpenAI's o4-mini costs more per token but used fewer tokens overall, making it cheaper in the benchmark.

Bar chart titled “Cost to Run Artificial Analysis Intelligence Index.” It shows total U.S. dollar costs to complete all tests in the Artificial Analysis Intelligence Index using different AI models. Bars are split into three colors: Input (blue), Reasoning (purple), Output (green). On the left are the most expensive models: GPT-3 ($1,951), Claude 3 Opus ($1,485), Gemini 2.5 Pro ($844). In the middle: Gemini 2.5 Flash with reasoning ($445), o4-mini (high) ($323). On the right are the cheapest models: Gemini 2.0 Flash ($3), Llama 3 8B ($2). A purple arrow above highlights the cost gap between Gemini 2.0 Flash and Gemini 2.5 Flash with reasoning, labeled “150x.” Source: Artificial Analysis. — Google's Gemini Flash 2.5 costs 150 times more to run with reasoning enabled than Flash 2.0, due to higher token use and pricing. | Image: Artificial Analysis

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Artificial Analysis via X

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI in practice

Jun 30, 2025Jun 30, 2025

Google brings Gemini for Education and Gemini in Classroom AI tools to schools

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Jun 30, 2025Jun 30, 2025

After Meta's recruiting push, OpenAI tries to retain talent

AI in practice

Jun 29, 2025Jun 29, 2025

LLM search optimization seems to mirror strategies used in classic SEO, study finds

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Gemini Flash 2.5 becomes 150 times more expensive for reasoning tasks than Flash 2.0

Google brings Gemini for Education and Gemini in Classroom AI tools to schools

After Meta's recruiting push, OpenAI tries to retain talent

LLM search optimization seems to mirror strategies used in classic SEO, study finds

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

Gemini Flash 2.5 becomes 150 times more expensive for reasoning tasks than Flash 2.0

Google brings Gemini for Education and Gemini in Classroom AI tools to schools

After Meta's recruiting push, OpenAI tries to retain talent

LLM search optimization seems to mirror strategies used in classic SEO, study finds