Content
summary Summary

Google is rolling out an updated image model called Nano Banana Pro, also known as Gemini 3 Pro Image.

Ad

It replaces the Gemini 2.5 Flash Image model from August and is built to handle complex scenes with consistent physics, render text accurately, and use real-time information as input. It also appears to be one of the first models that can generate infographic drafts that actually make sense.

The model can take up to 14 inputs at once, including reference images, sketches, or logos. It can keep up to five characters consistent and generate images at up to 4K resolution. Users can also adjust specific areas of an image, such as brightness, focus, or color.

The model is available as a paid preview through the Gemini API, Google AI Studio, and Vertex AI. In the Gemini app, consumer access is limited, with a small free quota and more usage for Pro and Ultra subscribers.

Ad
Ad

Google's new image model checks physics before it renders

Unlike standard diffusion models, Gemini 3 Pro performs a reasoning step before it renders anything. According to Google's technical description, the model reviews the inputs and checks physical and logical details such as lighting, shadow gradients, camera angle, and depth of field. Google says this leads to more realistic results, especially for architecture, product mockups, or scenes with multiple light sources.

A realistic depiction of a grey alien riding a horse, which in turn is riding on the shoulders of an astronaut. The alien, horse, and astronaut have correct and realistic proportions relative to each other. The scene is set in an oriental art style, with a moon on the left side of the composition and a sun on the right. The lighting reflects the duality of moonlight and sunlight. The overall aspect ratio is 16:9. | Image: Nano Banana Pro prompted by THE DECODER
"The alien should be smaller." | Image: Nano Banana Pro prompted by THE DECODER

With its "Grounding with Google Search" feature, the model can also reference live information. It can turn real-time data into images, creating things like current weather maps, infographics, or historically accurate scenes. It supports multimodal prompts and outputs that combine text and images. Token limits are 64,000 for inputs and 32,000 for outputs.

Gemini 3 Pro also brings major upgrades to text rendering. It’s designed to generate longer text passages that stay readable and accurate across languages, while preserving the visual style of posters, packaging, or similar layouts. Translations are context-aware and keep typography consistent.

The model supports multi-step editing, allowing users to refine images over several rounds. Google highlights examples like localized ad layouts and infographics, including a recipe for Elaichi Chai and my "Grey Alien" poster.

Prompt: "Generate a 16:9 infographic, humorous, about a 'Grey Alien'." | Image: Nano Banana Pro prompted by THE DECODER
German translation of the same infographic. | Image: Nano Banana Pro prompted by THE DECODER

Google has also integrated Gemini 3 Pro Image into its new developer platform Antigravity, allowing coding agents to generate UI mockups or visual assets. Image generation in Google Ads is rolling out globally in eight languages. Users with a Google AI Ultra subscription can also access the model in the Flow film production tool.

Recommendation
Bar chart showing Gemini 3 Pro Image scoring highest in Overall Preference, Visual Quality, and Infographics compared with Gemini 2.5 Flash Image, GPT-Image 1, Seeddream v4 4k, Flux Pro Kontext Max, and Hunyuan v3. | Image: Google

All images generated with Gemini 3 Pro include invisible SynthID tagging. Free and Pro users also receive a visible watermark, while only Ultra subscribers can generate watermark-free images. Google also offers an upload tool in Gemini that lets users verify an image's provenance.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Google has rolled out a new image model called "Nano Banana Pro" (Gemini 3 Pro Image). It's built to handle complex scenes more logically and place text correctly inside images.
  • The model adds reasoning tools that let it work with up to 14 inputs when creating an image and simulate basic physical relationships for more realistic results. It can also turn real-time information into images, including things like weather maps or infographics.
  • The model is available to businesses as a paid preview through the Gemini API. Regular users can try it in the Gemini app with a limited number of uses before it switches back to the previous model.
Sources
Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.