Ad
Skip to content

GTC 2026: With Groq 3 LPX, Nvidia adds dedicated inference hardware to its platform for the first time

At GTC 2026, Nvidia expanded the Vera Rubin platform it introduced at CES with custom CPU racks, dedicated inference chips, a new storage architecture, an inference operating system, open model alliances, and agent security software.

Read full article about: Mistral's new Small 4 model punches above its weight with 128 expert modules

Mistral AI has released Mistral Small 4, combining fast text responses, logical reasoning, and image processing in one model. It has 119 billion parameters, but only 6 billion are active per query - its architecture includes 128 expert modules but activates just four at a time. Users can control whether the model responds quickly or thinks more thoroughly. Mistral AI says it's 40 percent faster and handles three times more queries per second than its predecessor.

Balkendiagramm zeigt die Benchmark-Ergebnisse von Mistral Small 4 High im Vergleich zu Magistral Medium 1.2 und Magistral Small 1.2 in den Kategorien LCR, AIME25, Collie und LiveCodeBench.
Mistral Small 4 with a high reasoning level achieves similar or better values in internal benchmarks than the specialized Magistral models.

The model ships under the Apache 2.0 license and is available on Hugging Face, the Mistral API, and Nvidia platforms. Mistral AI is also joining the Nvidia Nemotron Coalition, which promotes open AI model development. The company previously released multimodal open-source models in early December with the Mistral 3 series, including the flagship Mistral Large 3 with 675 billion parameters.

OpenAI's biggest problem may not be building AI but getting companies to actually use it beyond ChatGPT

OpenAI is pushing to get its AI into large companies faster through sales, partnerships, and capital. A 10-billion-dollar joint venture and a new deployment arm show where the real challenge lies: getting the technology integrated into actual company workflows.

OpenAI's own wellbeing advisors warned against erotic mode, called it a "sexy suicide coach"

OpenAI’s wellbeing advisory board reportedly voted unanimously against the company’s planned Adult Mode for ChatGPT. Internally, the company is struggling with an error-prone age detection system and unresolved safety issues.

Read full article about: Alibaba consolidates AI efforts under new business unit led by CEO

Alibaba is merging its AI operations into a new business unit called "Alibaba Token Hub" (ATH), led directly by CEO Eddie Wu, Bloomberg reports. The unit brings together the research team behind the Qwen models, the consumer app division, the communication platform DingTalk, and Quark-branded devices like smart glasses.

The goal is to speed up collaboration between research, product development, and sales - and to better monetize AI across the company. The name "Token Hub" is a direct nod to the billing units used in the AI business.

According to insiders, Alibaba also plans to unveil an AI agent for enterprise customers later this week. The agent runs on Qwen and will gradually be integrated with Taobao and Alipay. The restructuring follows the surprise departure of Qwen research lead Junyang Lin in early March.

According to Bloomberg, Chinese AI providers have a harder time making money from AI than Western competitors like OpenAI, largely because Chinese users are reluctant to pay for software subscriptions.