Stability AI's new 3D model aims for real-time generation

Jan 10, 2025

Stability AI

Stability AI has unveiled SPAR3D, a new 3D reconstruction model that can create complete 3D objects from a single image in under a second.

At CES, Stability AI partnered with Nvidia to showcase SPAR3D (Stable Point Aware 3D), a system designed to work with Nvidia RTX graphics cards. The model's main selling point is its speed - it can transform a single image into a detailed 3D object almost instantly.

Video: Stability AI

According to Stability AI, SPAR3D offers a high level of control over 3D object creation. The generated point cloud can be edited directly by deleting, duplicating, stretching, coloring or adding new features. SPAR3D is also said to provide accurate geometry and detailed 360-degree views, including typically hidden areas.

Video: Stability AI

The numbers are impressive: it takes just 0.3 seconds to turn a point cloud into a finished mesh, and only 0.7 seconds to create a detailed 3D model from a single image. This speed could make life easier for game developers, product designers, and anyone working on environmental design.

SPAR3D uses a two-part system to create its 3D models. First, a specialized point diffusion model creates a detailed point cloud that captures the object's basic structure. Then, a component called the triplane transformer combines this point cloud with features from the original image, adding high-resolution details for geometry, texture, and lighting.

Flowchart: SPAR3D pipeline for 3D reconstruction with DINOv2 encoder, point cloud processing and triplane transformation for geometry/texture. — The SPAR3D architecture. | Image: Stability AI

Free use under community license

The model is available now under Stability AI's Community License, making it free for both personal and business use. Larger organizations bringing in more than a million dollars annually need to contact Stability AI about an enterprise license.

Anyone interested can download SPAR3D's weights from Hugging Face or access the source code on GitHub. Developers can also tap into the model through Stability AI's Developer Platform API.

The timing of this release makes sense, as Stability AI has been focusing on developing more efficient 3D models for a while now. Their partnership with Nvidia feels like a natural fit, especially since Nvidia just unveiled their own text-to-3D system, Edify 3D, in late 2024. They're not alone in this space - Meta, Midjourney, and Luma AI are all working on similar AI-powered 3D modeling technology.

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

Over 20 percent launch discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

Stability AI's new 3D model aims for real-time generation

Free use under community license

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.