Ad
Skip to content

YouTuber builds Google's staged Gemini demo in real-time with GPT-4 Vision

YouTuber "Greg Technology" has recreated Google's discredited multimodal Gemini AI demo using OpenAI's GPT-4 Vision to demonstrate real-time voice and vision prompts. The original Gemini AI demo video, which was criticized for being staged and not recorded in real-time, featured voice interactions that were later dubbed in. In response, Greg Technology released a video using GPT-4V in which he discussed a drawing, asked about emoticons, and had the AI identify a game. It's not as polished as Google's demo, of course, but it's real-time and real. Greg has published his demo code on GitHub.

Ad
DEC_D_Incontent-1

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

Source: YouTube