Anthropic's "Beta Steering API" offers developers a sneak peek at the future of controllable LLMs

Jun 16, 2024

Anthropic is testing a completely new steering option for large language models. The AI startup is offering developers access to its Beta Steering API, which can be used to customize the internal functions of language models. The API is based on recent research on the interpretability of language models. By strengthening individual concepts in models, their output can be strongly influenced.

Interested developers will get access to a subset of Claude's internal features, documentation, sample code, and possibly a Slack channel to communicate with the Anthropic team. In return, testers will be asked to share their projects with Anthropic and provide feedback. The technology is still in the research phase and is not intended for production use. Anthropic emphasizes that the API may be modified or discontinued at any time.

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

Over 20 percent launch discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

Anthropic's "Beta Steering API" offers developers a sneak peek at the future of controllable LLMs

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.