Author HubMatthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

AI in practice

Jun 28, 2025Jun 28, 2025

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

AI research

May 29, 2025May 29, 2025

Wait a minute! Researchers say AI's "chains of thought" are not signs of human-like reasoning

AI in practice

May 11, 2025May 11, 2025

Update

US Copyright Office says fair use does not cover AI trained on "vast troves of copyrighted works

AI in practice

Jan 1, 2025Jan 1, 2025

Short

According to internal documents obtained by TechCrunch, Google has been benchmarking its Gemini AI model against Anthropic's Claude. Google contractors are given up to 30 minutes per prompt to evaluate which model produces better outputs, focusing on criteria like truthfulness and comprehensiveness. Claude tends to be more safety-conscious in its answers compared to Gemini, according to Techcrunch. A Google DeepMind spokesperson confirmed that they do compare results across models, but stressed that they don't use Anthropic's models to directly improve Gemini, which would go against Anthropic's ToS. Also, this kind of competitive benchmarking is common in the AI industry - companies regularly benchmark their models against competitors to understand where they stand. Moreover, Google is an investor in Anthropic.

Techcrunch

AI in practice

Jan 1, 2025Jan 1, 2025

AI companies slash prices in battle for market share

AI in practice

Dec 31, 2024Dec 31, 2024

Nvidia snaps up Israeli AI infrastructure company Run:ai

AI in practice

Dec 31, 2024Dec 31, 2024

Hugging Face's Smolagents framework simplifies building AI agents with just a few lines of code

AI in practice

Dec 31, 2024Dec 31, 2024

ByteDance plans to invest billions in Nvidia AI chips through offshore data centers

AI in practice

Dec 31, 2024Dec 31, 2024

Short

OpenAI CEO Sam Altman recently took to X to gauge what users want from the company in 2025. The most requested features (?) were AGI, AI agents, a "much better 4o upgrade," better memory capabilities, and longer context windows. Users also clamored for an "adult mode" that would dial back OpenAI's content moderation, along with enhanced research capabilities and improvements to the company's Sora videogenerator. One of these wishes - better memory - is already in testing, though OpenAI hasn't revealed any details about the underlying technology. Interestingly, Altman noted that many of the features OpenAI actually plans to release in 2025 barely made it onto users' wishlists - or didn't show up at all.

Altman via X

AI research

Dec 30, 2024Dec 30, 2024

Update

Deepseek's V3 is the latest example of state-controlled censorship in Chinese LLMs

Google News

Author HubMatthias Bastian

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

Wait a minute! Researchers say AI's "chains of thought" are not signs of human-like reasoning

US Copyright Office says fair use does not cover AI trained on "vast troves of copyrighted works

AI companies slash prices in battle for market share

Nvidia snaps up Israeli AI infrastructure company Run:ai

Hugging Face's Smolagents framework simplifies building AI agents with just a few lines of code

ByteDance plans to invest billions in Nvidia AI chips through offshore data centers

OpenAI co-founder says new AI safety approach "may apply to AGI and beyond"

OpenAI's o1-preview model manipulates game files to force a win against Stockfish in chess

Meta envisions social networks where AI characters coexist alongside human accounts

Deepseek's V3 is the latest example of state-controlled censorship in Chinese LLMs

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Musk unveils Grok 4 as xAI’s new AI model that beats OpenAI and Google on major benchmarks

Bank details