Ad
Short

Google introduces "implicit caching" in Gemini 2.5, aiming to cut developer costs by as much as 75 percent. The new feature automatically detects and stores recurring content, so repeated prompts are only processed once. According to Google, this can lead to significant savings compared to the old explicit caching method, where users had to set up their own cache. To maximize the benefits, Google recommends putting the stable part of a prompt—like system instructions—at the start, and adding user-specific input, such as questions, afterwards. Implicit caching kicks in for Gemini 2.5 Flash starting at 1,024 tokens, and for Pro versions from 2,048 tokens onwards. More details and best practices are available in the Gemini API documentation.

Short

OpenAI is rolling out a new GitHub connector for ChatGPT's deep research agent. Users with Plus, Pro, or Team subscriptions can now connect their own GitHub repositories and ask questions about their code. ChatGPT searches through the source code and documentation in the repo, then returns a detailed report with source references. Only content that users already have access to is visible to ChatGPT, so existing permissions apply. The connector will become available to users over the next few days, with support for enterprise customers coming soon. According to OpenAI Product Manager Nate Gonzalez, the goal is to better integrate ChatGPT into internal workflows. OpenAI also plans to add more deep research connectors in the future.

Ad
Ad
Short

The Trump administration is moving to overturn the Biden administration's AI chip export restrictions, according to a spokesperson for the US Department of Commerce. Biden's rules, which were set to take effect on May 15, would have split the world into three different zones for AI chip exports, capping shipments to most countries. Trump officials say the current system is "overly complex, overly bureaucratic," and want to replace it with "a much simpler rule that unleashes American innovation and ensures American AI dominance." One idea on the table is a global licensing system based on intergovernmental agreements, but there is no set timeline for new regulations yet. Since taking office, the Trump administration has already rolled back several other Biden-era AI policies.

Short

Netflix is rolling out a new search feature powered by OpenAI's ChatGPT technology, allowing users to look for content using natural language. Instead of typing in titles or keywords, people can now search with phrases like "I want something funny, but not silly." The new search is launching as a beta on iOS devices, with early tests already taking place in Australia and New Zealand, according to Bloomberg. Netflix also announced plans to automatically translate title cards into users' preferred languages. Other updates include a redesigned video homepage and a vertical short-form video feed for mobile devices.

Ad
Ad
Short

Anthropic has launched a web search feature for its Claude API, letting developers combine Claude models with up-to-date web data without building their own search infrastructure. Claude decides when a web search makes sense based on the user's prompt, generates targeted queries, analyzes the results, and delivers answers with source links. The system can also handle multi-step research tasks. Web search is available for Claude 3.7 Sonnet, 3.5 Sonnet, and 3.5 Haiku, and costs $10 per 1,000 searches. Organizations can allow or block specific domains and manage web search access at the organizational level. The feature is also available for Claude Code, which can use it to look up API documentation or technical articles.

Ad
Ad
Short

Microsoft's Phi 4 model generates 56 sentences before responding to "Hi", developer Simon Willison found. This behavior, known as "overthinking", was confirmed by Microsoft's Dimitris Papailiopoulos, who says it's problematic for simple tasks but intentional for complex ones. He plans to address the issue. Microsoft released the open Phi 4 reasoning models in early May.

Phi 4's reasoning process shown in screenshot by Simon Willison.
Google News