AI in practice

Aug 14, 2024Aug 14, 2024

Anthropics prompt caching makes your long prompts much cheaper

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

Anthropic's prompt caching feature can cut the cost of long prompts by up to 90% and reduce latency by as much as 85%. The technology lets developers cache frequently used context between API calls, giving Claude more background knowledge and examples to work with. Prompt caching is now in public beta for Claude 3.5 Sonnet and Claude 3 Haiku models, with support for Claude 3 Opus on the way. The feature is a good fit for chat agents, coding assistants, long document processing, detailed instruction sets, agent-based search and tool usage. It also works well for answering questions about books, papers, documentation, and podcast transcripts, Anthropic says. Google also offers prompt caching.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Anthropic

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI in practice

Nov 2, 2024Nov 2, 2024

Anthropic's Claude 3.5 Sonnet can now analyze PDFs and images inside them

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Oct 24, 2024Oct 24, 2024

Anthropic's Claude AI can now crunch numbers and visualize data with built-in code tool

AI in practice

Jul 11, 2024Jul 11, 2024

Anthropic launches fine-tuning service and new prompt tuner

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Anthropics prompt caching makes your long prompts much cheaper

Anthropic's Claude 3.5 Sonnet can now analyze PDFs and images inside them

Anthropic's Claude AI can now crunch numbers and visualize data with built-in code tool

Anthropic launches fine-tuning service and new prompt tuner

OpenAI launches new ChatGPT agent that automates complex tasks for Pro, Plus, and Team

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

New Energy-Based Transformer architecture aims to bring better "System 2 thinking" to AI models

Anthropics prompt caching makes your long prompts much cheaper

Anthropic's Claude 3.5 Sonnet can now analyze PDFs and images inside them

Anthropic's Claude AI can now crunch numbers and visualize data with built-in code tool

Anthropic launches fine-tuning service and new prompt tuner