Ad
Skip to content

Cohere releases Embed 4, a multimodal model for enterprise document search

Cohere has introduced Embed 4, a multimodal language model designed for semantic search across complex enterprise documents. The model can process a wide range of content types—including text, images, tables, charts, code, and handwritten scans—commonly found in financial reports, medical records, and industrial documentation. Embed 4 supports files up to 128,000 tokens, or approximately 200 pages, and is compatible with over 100 languages, including Arabic, French, and Japanese. According to Cohere, the model is intended for organizations building language model-powered assistants that require access to internal knowledge. The model can be deployed either on-premises or in a private cloud environment, a configuration aimed at sectors with strict data sensitivity requirements, such as healthcare and manufacturing. Cohere says Embed 4 is now available through its own platform, as well as via Microsoft Azure AI Foundry and Amazon SageMaker.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Read on for the full picture.
Subscribe for hype-free coverage.

  • Access to all THE DECODER articles.
  • Read without distractions – no Google ads.
  • Access to comments and community discussions.
  • Weekly AI newsletter.
  • 6 times a year: “AI Radar” – deep dives on key AI topics.
  • Up to 25 % off on KI Pro online events.
  • Access to our full ten-year archive.
  • Get the latest AI news from The Decoder.
Subscribe to The Decoder