Meta takes control of its AI future

A new AI chip from Meta is designed to speed up the execution of neural networks, and an upgraded supercomputer is designed to speed up the company's own AI research.

The "Meta Training and Inference Accelerator" (MTIA) is a new family of chips designed to accelerate and cheapen the processing performance of neural networks, called inference. The chip is expected to be in use by 2025. For now, Meta is still relying on Nvidia graphics cards in its data centers.

Like Google's Tensor Processing Units (TPUs), the MTIA is an application-specific integrated circuit (ASIC) chip optimized for the matrix multiplication and activation functions found in neural networks. According to Meta, the chip can handle low and medium-complexity AI models better than a GPU.

Video: Meta AI

With Trainium and Inferentia, Amazon also offers access to AI chips for training and execution in the cloud. Microsoft is said to be working with AMD on AI chips.

Research SuperCluster: Meta's RSC AI supercomputer reaches phase two

In January 2022, Meta unveiled the RSC AI supercomputer, which it said at the time would lay the groundwork for the Metaverse. When fully developed, it is supposed to be the fastest supercomputer specializing in AI calculations. The company has been building this infrastructure since 2020.

According to Meta, the RSC has now reached its second stage with 2,000 Nvidia DGX A100 and 16,000 Nvidia A100 GPUs. The peak performance, according to Meta, is five exaflops. The RSC will be used for AI research in a variety of areas, including generative AI.

A unique feature of the RSC supercomputer is its ability to use data from Meta's production systems for AI training. Until now, Meta has relied primarily on open-source and publicly available datasets, although the company sits on a huge treasure trove of data.

Video: Meta

Recommendation

AI research

DeepMind's Genie 2 generates playable 3D worlds from single images

RSC has already made its mark: Meta trained the LLaMA language model on it, which - partly leaked, partly published - became the engine of the open-source language model movement. According to Meta, training the largest LLaMA model took 21 days on 2,048 Nvidia A100 GPUs.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Meta takes control of its AI future

Research SuperCluster: Meta's RSC AI supercomputer reaches phase two

DeepMind's Genie 2 generates playable 3D worlds from single images

Meta and Ohio State unveil Early Experience as a new training method for language agents

Mark Zuckerberg still seeks AI talent as Meta hires Andrew Tulloch from Thinking Machines Lab

Meta's Yann LeCun reportedly clashed with the company over new publication rules

ChatGPT's memory could turn personal details into ads OpenAI CEO Altman once called dystopian

The long-predicted deepfake dystopia has arrived with Sora 2

Anthropic claims to lower the entry barrier for advanced AI models with Claude Haiku 4.5

Meta takes control of its AI future

Research SuperCluster: Meta's RSC AI supercomputer reaches phase two

Share

Bank details