Google's new Gemini 1.5 AI models offer more power and speed at lower costs
Key Points
- Google has released two improved versions of its Gemini AI models: Gemini 1.5 Pro 002 and Gemini 1.5 Flash 002. The new models are said to be more powerful, faster, and cheaper than their predecessors.
- The prices for Gemini 1.5 Pro have been reduced by more than 50 percent for input and output tokens. Additionally, the rate limits for both models have been increased and latency reduced. The models have improved in various benchmarks, particularly in the areas of math, long context, and vision.
- The Gemini models are available via Google AI Studio, the Gemini API, and, for Google Cloud customers, on Vertex AI. For Gemini Advanced users, Google will soon release a chat-optimized version of Gemini 1.5 Pro-002.
Google has released two updated Gemini AI models that promise more power, speed, and lower costs.
The new versions, Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, offer significant improvements over their predecessors, according to Google, showing gains across a range of benchmarks, particularly in maths, long context and visual tasks.
In addition, the company has reduced the price of input and output tokens for Gemini 1.5 Pro by more than 50%, increased rate limits for both models and reduced latency.

New Gemini models perform better at math benchmarks
On the MMLU-Pro benchmark, a more challenging version of MMLU, the models improved by about 7%. Math performance saw a notable 20% boost on the MATH and HiddenMath benchmarks. Vision and code-related tasks also saw improvements, with 2-7% gains in visual understanding and Python code generation evaluations.

Google claims the models now provide more helpful responses while maintaining content safety standards. The company refined the models' output style based on developer feedback, aiming for more precise and cost-effective use.
Google has also released an improved version of the Gemini 1.5 experimental model announced in August. The updated version, 'Gemini-1.5-Flash-8B-Exp-0924', offers further enhancements for text and multimodal applications.
Users can access the new Gemini models through Google AI Studio, the Gemini API, and Vertex AI for Google Cloud customers. A chat-optimized version of Gemini 1.5 Pro-002 is coming soon for Gemini Advanced users.
The new pricing takes effect on October 1, 2024, for prompts under 128,000 tokens. Combined with context caching, Google expects development costs with Gemini to decrease further.
AI News Without the Hype – Curated by Humans
As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.
Subscribe now