Ad
Skip to content
Read full article about: OpenAI turns model compression into a talent hunt with its 16 MB "Parameter Golf" challenge

OpenAI challenges researchers to build the best language model in just 16 MB - and uses the competition to scout talent. In an open research competition called "Parameter Golf," OpenAI is asking developers to build the best possible language model under tight constraints: weights and training code combined must stay under 16 MB, and training can take no longer than ten minutes on eight H100 GPUs. Submissions are judged on compression performance against a fixed FineWeb dataset.

OpenAI is putting up one million dollars in computing credits through its partner Runpod. Top performers may get invited for job interviews - the company plans to hire a small group of junior researchers in June, including students and Olympiad winners. The GitHub repository includes baseline models, evaluation scripts, and a public leaderboard. Anyone 18 or older in supported countries can participate through April 30.

The competition for AI talent among big tech companies is more intense than ever. Meta has repeatedly poached top researchers from OpenAI, in some cases offering compensation packages reportedly worth up to 300 million dollars.

Pentagon plans to let AI companies train models on classified data

The US Department of War is working to set up secure environments where AI companies can train their models on classified data. Until now, models were only allowed to read classified data, not learn from it.

Ad

Beijing approves Nvidia's H200 chip sales as the company builds a China-ready version of its Groq inference chip

Nvidia has received long-awaited approval from Beijing to sell its second-most-powerful AI chip, the H200, to Chinese customers, Reuters reports. The company had halted production of the chip last year due to regulatory hurdles on both sides of the Pacific.

OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier

OpenAI has released two new compact models—GPT-5.4 mini and nano—built for coding assistants, subagents, and computer control. GPT-5.4 mini nearly matches the full model’s performance, but both new models come with a steep price hike over their predecessors.

Ad

GTC 2026: With Groq 3 LPX, Nvidia adds dedicated inference hardware to its platform for the first time

At GTC 2026, Nvidia expanded the Vera Rubin platform it introduced at CES with custom CPU racks, dedicated inference chips, a new storage architecture, an inference operating system, open model alliances, and agent security software.

Ad
Read full article about: Mistral's new Small 4 model punches above its weight with 128 expert modules

Mistral AI has released Mistral Small 4, combining fast text responses, logical reasoning, and image processing in one model. It has 119 billion parameters, but only 6 billion are active per query - its architecture includes 128 expert modules but activates just four at a time. Users can control whether the model responds quickly or thinks more thoroughly. Mistral AI says it's 40 percent faster and handles three times more queries per second than its predecessor.

Balkendiagramm zeigt die Benchmark-Ergebnisse von Mistral Small 4 High im Vergleich zu Magistral Medium 1.2 und Magistral Small 1.2 in den Kategorien LCR, AIME25, Collie und LiveCodeBench.
Mistral Small 4 with a high reasoning level achieves similar or better values in internal benchmarks than the specialized Magistral models.

The model ships under the Apache 2.0 license and is available on Hugging Face, the Mistral API, and Nvidia platforms. Mistral AI is also joining the Nvidia Nemotron Coalition, which promotes open AI model development. The company previously released multimodal open-source models in early December with the Mistral 3 series, including the flagship Mistral Large 3 with 675 billion parameters.