Read full article about: Andrej Karpathy says humans are now the bottleneck in AI research with easy-to-measure results
Karpathy spent months hand-tuning his GPT-2 training setup. Then he let an autonomous agent take over for a single night. The agent discovered fine-grained adjustments Karpathy had overlooked, tweaks that also interact with each other in ways that are easy for a human to miss but straightforward for a systematic search to catch.
Karpathy's takeaway is that researchers should remove themselves from the loop, at least in areas where objective metrics exist. "To get the most out of the tools that have become available now, you have to remove yourself as the bottleneck. You can't be there to prompt the next thing," Karpathy says. Researchers at major AI labs, he argues, place too much unfounded trust in their own intuition and are ultimately in the process of systematically automating themselves out of a job. Which, Karpathy notes, is also their stated goal.
While models keep getting better at coding and other easy-to-verify tasks, Karpathy doesn't think these gains will carry over smoothly to less measurable domains. "Anything that feels softer is, like, worse," he says.
Comment
Source: No Priors Podcast
OpenAI publishes a prompting playbook that helps designers get better frontend results from GPT-5.4
In a new guide, OpenAI explains how front-end designers can get better results from GPT-5.4 when building websites and apps and how to stop the model from falling back on generic designs.
Ad
Math needs thinking time, everyday knowledge needs memory, and a new Transformer architecture aims to deliver both
A German research team lets Transformer models decide for themselves how many times they think about a problem. Combined with additional memory, the approach outperforms larger models on math problems.
Read full article about: OpenAI plans to nearly double its workforce by 2026 as it ramps up enterprise push
The AI lab wants to grow from 4,500 to 8,000 employees by the end of 2026, the Financial Times reports, citing two people familiar with the plans. Most new hires will go into product development, engineering, research, and sales. OpenAI is also bringing on "technical ambassadorship" specialists to help companies integrate its tools.
Much of this hiring likely ties back to OpenAI's Frontier, an agent-based AI platform designed to embed deeply into company workflows, the kind of integration that requires hands-on development at the customer's site. OpenAI has already launched the Frontier Alliance with consulting firms like McKinsey, and partnerships with private equity firms are in the works.
The broader context is OpenAI's push to win enterprise customers, particularly in coding, where Anthropic has been steadily gaining ground. While OpenAI was focused on ChatGPT features, image generation, video models, and all the weird outcomes that came with people actually using this technology, Anthropic quietly carved out a bigger share of the enterprise space. OpenAI is now reportedly building a desktop super app that bundles all its key features into one platform.
Comment
Source: Financial Times
Ad
95% of UK students now use AI and their experiences couldn't be more divided
95 percent of British students use generative AI. But while some say it deepens their learning, others worry it’s replacing their ability to think for themselves. A new survey reveals a student body caught between enthusiasm, overwhelm, and universities that aren’t keeping up.
OpenAI's chief scientist trusts AI with experiments but says it's not at the level to design complex systems
OpenAI Chief Scientist Jakub Pachocki used to write every line of code by hand. Now AI handles experiments that once took him a week, but he’s not ready to let it run the show.
Ad
Europe's AI paradox is record adoption that funds foreign ecosystems instead of building its own
Europe leads in AI adoption and matches the US in talent, but owns almost none of the platforms it depends on. A new report by Prosus and Dealroom lays out where the disconnect starts: from missing infrastructure and fragmented regulation to a funding gap that hands Europe’s best startups to American investors. Closing that gap won’t be easy.