Content
summary Summary

Google has launched developer access to Gemini Pro, its new multimodal AI that can process both text and image input.

The model, which runs off-device in Google's data centers, is accessible through the Gemini API and can be integrated using the Google AI SDK, a client SDK for Android. This SDK frees developers from having to build their own backend infrastructure.

Developers can integrate Gemini Pro, generate API keys, and build AI applications all through Google AI Studio. Google says it made it easier for developers to use the Gemini API with a new project template in the latest preview of Android Studio.

Last week, Google launched access to Gemini Nano, an on-device model available on select devices through AICore. Nano will run on Google's new Pixel 8.

Ad
Ad

The MLLM benchmark wars are on

Gemini Pro's performance is about the same as GPT-3.5, but according to Google, it outperforms OpenAI's model in six out of eight benchmarks. Google has also upgraded its ChatGPT competitor Bard with Gemini Pro.

Google's GPT-4 competitor Ultra will follow early next year. Google has shown that it can outperform GPT-4 on important benchmarks such as MMLU using special prompting methods. Microsoft, meanwhile, has shown that GPT-4, also driven by special prompts, is just as capable, if not more so, on benchmarks.

Microsoft also recently introduced Phi-2, also a small, large language model optimized for on-device performance like Gemini Nano. Phi-2 outperforms Google's Gemini Nano in all benchmarks presented by Microsoft.

But that's just benchmarks, the more interesting part will be how the model behaves in real-world scenarios.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Google has launched developer access to Gemini Pro, a multimodal AI that can process both text and image input and can be integrated using the Google AI SDK for Android. Gemini Nano is also available for mobile devices.
  • Gemini Pro's performance is similar to GPT-3.5, and Google claims that it outperforms the OpenAI model in six out of eight benchmarks. Google has also updated its ChatGPT competitor Bard with Gemini Pro.
  • Microsoft introduced Phi-2, a small, large language model optimized for on-device performance, which outperforms Google's Gemini Nano in all benchmarks presented by Microsoft.
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.