Ad
Skip to content
Read full article about: Google Deepmind upgrades Gemini 3 Deep Think for complex science and engineering tasks

Google Deepmind has upgraded its specialized thinking mode "Gemini 3 Deep Think" and made it available through the Gemini app and as an API via a Vertex AI early access program. The upgrade targets complex tasks in science, research, and engineering.

Google AI Ultra subscribers can access Deep Think through the Gemini app, while developers and researchers can sign up separately for the API program. According to Google Deepmind, the model tops several major benchmarks:

Benchmark Deep Think Claude Opus 4.6 GPT-5.2 Gemini 3 Pro Preview
ARC-AGI-2 (Logical reasoning) 84.6% 68.8% 52.9% 31.1%
Humanity's Last Exam (Academic reasoning) 48.4% 40.0% 34.5% 37.5%
MMMU-Pro (Multimodal reasoning) 81.5% 73.9% 79.5% 81.0%
Codeforces (Coding/algorithms, Elo) 3,455 2,352 - 2,512

While Deep Think dominates in logic and coding, the gap narrows significantly on MMMU-Pro: it scored 81.5 percent, barely ahead of Gemini 3 Pro Preview at 81.0 percent. This suggests the thinking upgrades focus heavily on abstract reasoning rather than visual processing. Deep Think also achieved gold medal-level results at the 2025 Physics and Chemistry Olympiads. Examples of the model in scientific use can be found here.

Pentagon pushes AI companies to deploy unrestricted models on classified military networks

The Pentagon is pressing leading AI companies including OpenAI, Anthropic, Google, and xAI to make their AI tools available on classified military networks – without the usual usage restrictions.

Ad
Read full article about: Anthropic promises to cover consumer electricity costs from new data center construction

The company plans to fully absorb grid upgrade costs, invest in new power generation, and cap its data centers' energy consumption during peak hours. Anthropic CEO Dario Amodei told NBC News that the costs of AI models should fall on Anthropic, not on citizens.

Microsoft and OpenAI made similar commitments back in January. The pledges come amid growing political pressure: New York senators introduced a bill that would pause new data center permits, and Senator Van Hollen is pushing legislation that would require AI companies to cover expansion costs themselves.

According to Politico, the Trump administration is also preparing a voluntary agreement that would commit AI companies to covering electricity price increases. The Lawrence Berkeley National Lab estimates that data centers could consume around 12 percent of all US electricity by 2028 - up from 4.4 percent in 2024.

Read full article about: OpenAI reportedly uses a "special version" of ChatGPT to hunt down internal leakers by scanning Slack and email

OpenAI uses a "special version" of ChatGPT to track down internal information leaks. That's according to a report from The Information, citing a person familiar with the matter. When a news article about internal operations surfaces, OpenAI's security team feeds the text into this custom ChatGPT version, which has access to internal documents as well as employees' Slack and email messages.

The system then suggests possible sources of the leak by identifying files or communication channels that contain the published information and showing who had access to them. It's unclear whether OpenAI has actually caught anyone using this method.

What exactly makes this version special isn't known, but there's a clue: OpenAI engineers recently presented the architecture of an internal AI agent that could serve this purpose. It's designed to let employees run complex data analysis using natural language and has access to institutional knowledge stored in Slack messages, Google Docs, and more.

OpenAI researcher quit over ads because she doesn't trust her former employer to keep its own promises

OpenAI wants to put ads in ChatGPT and former researcher Zoe Hitzig says that’s a dangerous move. She spent two years at the company and doesn’t believe OpenAI can resist the temptation to exploit its users’ most personal conversations.

Ad
Read full article about: OpenAI upgrades Responses API with features built specifically for long-running AI agents

OpenAI is adding new capabilities to its Responses API that are built specifically for long-running AI agents. The update brings three major features: server-side compression that keeps agent sessions going for hours without blowing past context limits, controlled internet access for OpenAI-hosted containers so they can install libraries and run scripts, and "skills": reusable bundles of instructions, scripts, and files that agents can pull in and execute on demand.

Skills work as a middle layer between system prompts and tools. Instead of stuffing long workflows into every prompt, developers can package them as versioned bundles that only kick in when needed. They ship as ZIP files, support versioning, and work in both hosted and local containers through the API. OpenAI recommends building skills like small command-line programs and pinning specific versions in production.

Ad