Content
summary Summary

Google has opened broader access to Gemini 2.5 Pro, its latest AI flagship model, which demonstrates impressive performance in scientific testing while introducing competitive pricing.

Ad

According to Alphabet CEO Sundar Pichai, Gemini 2.5 Pro represents Google's "most intelligent model + now our most in demand." Demand has increased by over 80 percent this month alone across both Google AI Studio and the Gemini API. Starting this week, users can access an expanded public preview with higher usage limits, including a free tier option.

Gemini Web Chat users can continue accessing the 2.5 Pro Experimental model, which should deliver equivalent performance. Google plans additional announcements at its Cloud Next '25 conference on April 9.

Competitive pricing

The Gemini 2.5 Pro API follows a tiered pricing model. For prompts up to 200,000 tokens, input costs $1.25 per million tokens, with output at $10. Larger prompts increase to $2.50 and $15 per million tokens respectively. While prompt caching isn't currently available, even in the paid tier, its future implementation could further reduce costs.

Ad
Ad

Google offers free grounding with Google search for up to 500 queries daily, followed by 1,500 additional free queries. Beyond that, each 1,000 queries costs $35. According to the terms of use, free tier data may be used for AI training, while paid tier data cannot.

Table shows pricing for input and output tokens, as well as conditions for the Grounding API and data usage for Gemini 2.5 Pro.
Overview of prices and conditions of the Gemini 2.5 Pro API in the free and paid tiers. | Image: Screenshot via Google

Compared to competing models such as Claude 3.7 Sonnet, Gemini 2.5 Pro is significantly cheaper with the same or better performance. The price-performance battle in the model market therefore continues.

Strong performance in scientific testing

The AI research group EpochAI reports that Gemini 2.5 Pro scored 84% on the GPQA Diamond benchmark - notably higher than human experts' typical 70% score. The benchmark features particularly challenging multiple-choice questions across biology, chemistry, and physics. EpochAI's independent test validates Google's benchmark results.

Bar chart shows Gemini 2.5 Pro with 84% accuracy in GPQA benchmark, well ahead of Claude 3.7 Sonnet and other models.
Gemini 2.5 Pro achieves the highest accuracy of all models tested in the GPQA Diamond benchmark. | Image: EpochAI

While Google hasn't released technical specifics about the model's architecture, training data, or computational requirements, it's known to be a "reasoning" model similar to OpenAI's o-series. EpochAI notes their testing has been limited by the experimental model's current rate restrictions.

The model's capabilities extend beyond GPQA. On the challenging "Humanity's Last Exam," Gemini 2.5 Pro achieved 18.8% - the highest score among models without additional tools, significantly outperforming competitors like Deepseek-R1's nine percent.

Recommendation

In weekly testing on trackingAI.org, the experimental version demonstrated impressive cognitive abilities, scoring an average IQ of 130 - well above the typical 90-110 range seen in other language models.

A graph showing IQ test results from TrackingAI.org comparing various AI models. The x-axis shows IQ scores from 50 to 160, with a bell curve distribution peaking around 100. Various AI models are plotted along the curve with their logos and names, including GPT-4, Llama, Gemini, Claude, and others. Gemini 2.5 Pro Experimental appears near the higher end of the scale. The graph includes a legend at the bottom identifying all models tested. The interface shows options to 'Reset', 'Show Offline Test', and 'Show Mensa Norway' at the top.
Gemini 2.5 Pro Experimental achieves the highest score (116) of all language models in the weekly IQ test.

These IQ assessments use text versions of the Norwegian Mensa IQ test, presenting questions verbally rather than visually like traditional vision models. The questions aren't included in training data, and if a model hesitates to answer, it gets up to ten attempts before its last valid response is recorded.

Google's new model has also received consistently positive feedback on X. Computer scientist François Chollet describes Gemini 2.5 Pro as his daily working model. For him, it is the best model for almost all tasks - with the exception of image generation, where it also performs well.

According to investor Martin Casado, he uses it almost exclusively for coding tasks. In his comparison table, Peter Yang rated Gemini 2.5 as currently the best model for programming tasks. Japanese AI researcher Shane Gu praises the model's cost-benefit ratio in particular: Gemini is on the Pareto frontier in all price categories.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Google has expanded access to its Gemini 2.5 Pro AI model through a Free Tier option and tiered API pricing, offering lower costs compared to similar models in the market.
  • The model achieved 84% accuracy on the GPQA Diamond scientific benchmark test, performing better than human experts, with results verified by EpochAI.
  • The AI community, particularly developers on X, commends Gemini 2.5 Pro's effectiveness and cost efficiency, especially for programming and coding applications.
Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.