Content
summary Summary

Anthropic has rolled out Claude Opus 4.1, an updated version of its top-tier hybrid language model.

Ad

The new release brings targeted improvements in programming, showing better performance in code refactoring, data-heavy analysis, and agentic capabilities—the ability to handle complex, multi-step tasks on its own.

Claude Opus 4.1 is now available to paying users on Claude, Claude Code, as well as through the API, Amazon Bedrock, and Google Cloud Vertex AI. Pricing remains the same as for the previous Opus 4 model. Developers can access the updated model using the claude-opus-4-1-20250805 API tag.

New high score in real-world programming benchmark

Claude Opus 4.1 sets a record on the SWE-bench Verified benchmark, scoring 74.5 percent. That's about two points higher than Opus 4 and roughly five points ahead of OpenAI’s o-series. OpenAI’s latest open source model lags further behind. The benchmark measures how well AI models can identify and fix real bugs in open-source code.

Ad
Ad

Beyond programming, Claude Opus 4.1 also sees gains in analysis and research tasks. Anthropic says the model is now better at tracking details and conducting agent-style searches.

Benchmark table: Claude Opus 4.1 vs Opus 4, Sonnet 4, OpenAI o3, Gemini 2.5 Pro in various AI tasks.
Claude Opus 4.1 edges out other leading AI models in areas like agentic coding, visual reasoning, and math competitions. | Image: Anthropic

The now-shuttered coding startup Windsurf reported that Claude Opus 4.1 delivered a one standard deviation improvement on its internal benchmark for junior developers—similar to the jump seen when moving from Sonnet 3.7 to Sonnet 4.

Prepping for GPT-5

The timing of Claude Opus 4.1’s release is notable, as OpenAI is preparing to launch its highly anticipated GPT-5. According to The Information, GPT-5 is expected to raise the bar in programming, math, and agent-based tasks, though it likely won’t bring the same leap seen between GPT-3 and GPT-4.

With only incremental gains expected from GPT-5, Anthropic’s latest update may be enough to keep pace. The company recommends all users move from Opus 4 to Opus 4.1 and says "substantially larger" improvements are on the way in the coming weeks. The move signals Anthropic’s intent to hold its ground as GPT-5 approaches.

For more details, see the system card, model page, pricing page, and docs.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Recommendation
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Anthropic has introduced Claude Opus 4.1, an improved version of its premium AI model, with better results in programming, handling complex data analyses, and managing independent multi-step tasks.
  • In the SWE-bench Verified Benchmark, Claude Opus 4.1 set a record by reaching 74.5 percent, surpassing both its earlier version, Opus 4, and OpenAI's o-series models.
  • The update is available at no additional cost to paying users, and Anthropic advises switching to Claude Opus 4.1 while promising more significant upgrades in the weeks ahead.
Sources
Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.