AI in practice

Jan 1, 2025Jan 1, 2025

Google pits its Gemini AI against Anthropic's Claude in internal benchmarking tests

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

According to internal documents obtained by TechCrunch, Google has been benchmarking its Gemini AI model against Anthropic's Claude. Google contractors are given up to 30 minutes per prompt to evaluate which model produces better outputs, focusing on criteria like truthfulness and comprehensiveness. Claude tends to be more safety-conscious in its answers compared to Gemini, according to Techcrunch. A Google DeepMind spokesperson confirmed that they do compare results across models, but stressed that they don't use Anthropic's models to directly improve Gemini, which would go against Anthropic's ToS. Also, this kind of competitive benchmarking is common in the AI industry - companies regularly benchmark their models against competitors to understand where they stand. Moreover, Google is an investor in Anthropic.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Techcrunch

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI and society

Mar 14, 2025Mar 14, 2025

OpenAI and Anthropic raise alarm over China's Deepseek in warnings to US government

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI research

Dec 21, 2024Dec 21, 2024

Anthropic's Claude AI cooperates better than OpenAI and Google models, study finds

AI research

Nov 11, 2024Nov 11, 2024

OpenAI co-founder Sutskever predicts a new AI "age of discovery" as LLM scaling hits a wall

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Google pits its Gemini AI against Anthropic's Claude in internal benchmarking tests

OpenAI and Anthropic raise alarm over China's Deepseek in warnings to US government

Anthropic's Claude AI cooperates better than OpenAI and Google models, study finds

OpenAI co-founder Sutskever predicts a new AI "age of discovery" as LLM scaling hits a wall

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

Google pits its Gemini AI against Anthropic's Claude in internal benchmarking tests

OpenAI and Anthropic raise alarm over China's Deepseek in warnings to US government

Anthropic's Claude AI cooperates better than OpenAI and Google models, study finds

OpenAI co-founder Sutskever predicts a new AI "age of discovery" as LLM scaling hits a wall