IBM brings Groq's ultra-fast AI inference to watsonx platform

IBM is integrating Groq's inference technology into its watsonx platform, aiming to deliver faster and more affordable AI for enterprise customers.

The partnership gives IBM clients access to GroqCloud through watsonx Orchestrate. Groq claims its proprietary Language Processing Unit (LPU) architecture can process workloads over five times faster and more cost-efficiently than traditional GPU-based systems.

IBM highlights potential use cases like healthcare, where thousands of patient questions need to be processed simultaneously, and HR automation in retail. The companies also plan to combine Red Hat's open-source vLLM technology with Groq's LPU hardware, and IBM's Granite models will be supported on GroqCloud as well. IBM clients can access GroqCloud's capabilities immediately.

Founded in 2016, Groq says it now has over two million developers using its platform. The company positions itself as a GPU alternative and part of the "American AI Stack." The partnership aims to help customers scale AI agents from pilot projects to production, with a focus on industries like healthcare, finance, government, retail, and manufacturing, where speed, cost, and reliability are essential.

IBM brings Groq's ultra-fast AI inference to watsonx platform

GPT-5 generates the "most impressive LLM output" yet, says OpenAI researcher

Chatbots are now rivaling social networks as a core layer of internet infrastructure

Pinokio 5.0 turns local machines into personal AI clouds

The ARC benchmark's fall marks another casualty of relentless AI optimization

DeepseekMath-V2 is Deepseek's latest attempt to pop the US AI bubble

Frustrated authors withdraw papers after realizing their reviewers are just lazy LLMs

IBM brings Groq's ultra-fast AI inference to watsonx platform

GPT-5 generates the "most impressive LLM output" yet, says OpenAI researcher

Chatbots are now rivaling social networks as a core layer of internet infrastructure

Pinokio 5.0 turns local machines into personal AI clouds