Cohere releases Embed 4, a multimodal model for enterprise document search

Apr 16, 2025

Cohere has introduced Embed 4, a multimodal language model designed for semantic search across complex enterprise documents. The model can process a wide range of content types—including text, images, tables, charts, code, and handwritten scans—commonly found in financial reports, medical records, and industrial documentation. Embed 4 supports files up to 128,000 tokens, or approximately 200 pages, and is compatible with over 100 languages, including Arabic, French, and Japanese. According to Cohere, the model is intended for organizations building language model-powered assistants that require access to internal knowledge. The model can be deployed either on-premises or in a private cloud environment, a configuration aimed at sectors with strict data sensitivity requirements, such as healthcare and manufacturing. Cohere says Embed 4 is now available through its own platform, as well as via Microsoft Azure AI Foundry and Amazon SageMaker.

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

Over 20 percent launch discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

Cohere releases Embed 4, a multimodal model for enterprise document search

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.