Content
summary Summary

Stability AI announces the preview release of Stable Diffusion 3, which shows significantly improved overall generation quality in early demos.

Specifically, Stability AI promises improved performance on multi-part, complex prompts, image quality, and text writing capabilities. Stability AI CEO Emad Mostaque shows an example of how accurately Stable Diffusion 3 executes a complex prompt.

Emad Mostaque, CEO of Stability AI, demonstrates how Stable Diffusion 3 accurately executes a complex prompt. | Image: Screenshot via X

Whether it always works this reliably, and how many attempts per image are needed to achieve such a result, remains to be seen in practice. According to Mostaque, the image was generated with an untuned base model of Stable Diffusion 3. The demos on X so far suggest an even better prompt following than OpenAI's DALL-E 3, which is currently the best in class in this category.

Bild: Screenshot via X

Stable Diffusion 3 models range from 800 million to 8 billion parameters and combine new image generation research from recent years, including the Diffusion Transformer Architecture with Flow Matching. A detailed technical report will be released shortly, Stability AI says.

Ad
Ad

The model is not yet generally available, but there is a waiting list that you can sign up for here. The preview phase is used to improve performance and safety before the "open release," the company states.

Stable Diffusion 3 should be able to generate better text. | Image: Stability AI
Image: Stability AI
Image: Stability AI

Stability AI says it has taken numerous safety precautions to prevent the model from being misused by malicious actors, starting with training and continuing through testing, evaluation, and deployment.

The company emphasizes ongoing collaboration with researchers, experts, and the community in the development and public use of the model. Because they are open source and fine-tunable, Stable Diffusion models are easy targets for misuse in controversial AI imaging applications.

Stable Diffusion has also been criticized and sued over its training data. For Stable Diffusion 3, artists removed millions of works from the training data in advance. Stability AI avoided this issue in the announcement of Stable Diffusion 3.

Stability AI has recently released several new models, including Stable Cascade, a very fast text-to-image model. Other new models include Stable Video Diffusion (SVD), a generative video model that produces AI-generated videos with improved motion and consistency, and Stable Zero123, a model for text-to-3D applications.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Recommendation
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Stability AI announces the pre-release of Stable Diffusion 3, the company's most powerful text-to-image model, with improved performance on complex prompts and higher image quality.
  • The Stable Diffusion 3 suite of models includes 800 million to 8 billion parameters. According to Stability AI, they combine new research approaches such as Diffusion Transformer Architecture and Flow Matching.
  • Stability AI says it has taken safety precautions to prevent misuse of the model and emphasizes collaboration with researchers, experts, and the community in the development and deployment of the model.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.