AI research Archive

Jonathan Kemper

Mar 8, 2026

AI research

AI agent benchmarks obsess over coding while ignoring 92% of the US labor market, study finds

Jonathan Kemper

Mar 7, 2026

AI research

When language models hallucinate, they leave "spilled energy" in their own math

Multi-stage funnels with purple swirl filter blue data cube and sphere stream and generate orange block fragments.

Jonathan Kemper

Mar 7, 2026

AI research

Bytedance's open-weight Helios model brings minute-long AI video generation close to real time

Yellow sports car on forest road with overlay text about AI video generation by Helios

Jonathan Kemper

Mar 7, 2026

Update

AI research

Video AI models hit a reasoning ceiling that more training data alone won't fix, researchers say

Filmstrip collage: Video reasoning tasks such as maze navigation, 3D rotation, tile puzzles, object counting and physical predictions.

Maximilian Schreiner

Mar 6, 2026

AI research

AI models can barely control their own reasoning, and OpenAI says that's a good sign

With GPT-5.4 Thinking, OpenAI is reporting on “CoT controllability” for the first time – a measure of whether AI models can deliberately manipulate their own reasoning. An accompanying study finds that reasoning models almost universally fail at this task, which OpenAI says is encouraging for AI safety.

Read full article

Comment

Jonathan Kemper

Mar 1, 2026

AI research

AI can link fake online names to real identities in minutes for just a few dollars

Central silhouette in device display, connected via luminous lines with colorful abstract image fragments.

Jonathan Kemper

Mar 1, 2026

AI research

Moltbook's alleged AI civilization is just a massive void of bloated bot traffic

Over 2.6 million AI agents interact on Moltbook with zero human involvement. They post, comment, and vote, but a new study shows they never learn from each other. It’s hollow interaction without mutual influence, shared memory, or social structures, a new study finds.

Read full article

Comment

Jonathan Kemper

Feb 28, 2026

AI research

Current language model training leaves large parts of the internet on the table

Large language models learn from web data, but which pages actually make it into training sets depends heavily on a seemingly mundane choice: the HTML extractor. Researchers at Apple, Stanford, and the University of Washington found that three common extraction tools pull surprisingly different content from the same web pages.

Read full article

Comment

Tomislav Bezmalinović

Feb 28, 2026

AI research

A new benchmark pits five AI models against each other as autonomous social media agents on X

Grid of logos of several AI models as a symbol for their competition in the social arena.

Jonathan Kemper

Feb 26, 2026

AI research

Anthropic can't stop humanizing its AI models, now Claude Opus 3 gets a retirement blog

Anthropic is retiring its Claude Opus 3 AI model and letting it publish weekly essays on Substack. The company says it conducted “retirement interviews” to ask the model about its wishes, and it “enthusiastically” agreed. The move is a prime example of how AI companies keep pushing the humanization of their products, blurring the line between philosophical caution and PR stagecraft.

Read full article

Comment