Ad
Skip to content

Google plans "next generation series of models" for 2024

Image description
DALL-E 3 prompted by THE DECODER

According to Alphabet's CEO, Google's Gemini is just the first of a series of next-generation AI models that Google plans to bring to market in 2024.

With the multimodal Gemini AI model, Google wants to at least catch up with OpenAI's GPT-4. The model is expected to be released later this year. In the recent quarterly earnings call, Alphabet CEO Sundar Pichai said that Google is "getting the model ready".

Gemini will be released in different sizes and with different capabilities, and will be used for all internal products immediately, Pichai said. So it is likely that Gemini will replace Google's current PaLM-2 language model. Developers and cloud customers will get access through Vertex AI.

Most importantly, Google is "laying the foundation of what I think of as the next-generation series of models we'll be launching throughout 2024," Pichai said.

Ad
DEC_D_Incontent-1

"The pace of innovation is extraordinarily impressive to see. We are creating it from the ground-up to be multimodal, highly efficient tool and API integrations and more importantly, laying the platform to enable future innovations as well," Pichai said.

Create a prototype app with an image-text prompt: Gemini powers no-code app development in Google's "Stubbs"

Developer Bedros Pamboukian reports on a new AI tool in the works at Google called Stubbs, which will likely be powered by Gemini. Pamboukian believes that Stubbs could be Google's most important release.

Stubbs is designed to make it easier to prototype apps or AI models by generating prototypes from a text description or an image (or both combined), which can then be published and shared much like Figma prototypes. Pamboukian has not yet been able to determine the exact functionality based on code snippets, but he shows first screenshots of the presumed user interface.

The interface of "Stubbs": Here you can generate an app prototype with a combined text-image prompt. | Image: Google via Bedros Pamboukian

During his research, Pamboukian also came across an AI model called "Multimodal IT M," which could be a variant of Gemini. In addition to text, the model can also process images and write subtitles for images, for example. These functions are also offered by Google Bard or GPT-4V.

Ad
DEC_D_Incontent-2

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

  • Over 20 percent launch discount.
  • Read without distractions – no Google ads.
  • Access to comments and community discussions.
  • Weekly AI newsletter.
  • 6 times a year: “AI Radar” – deep dives on key AI topics.
  • Up to 25 % off on KI Pro online events.
  • Access to our full ten-year archive.
  • Get the latest AI news from The Decoder.
Subscribe to The Decoder