Content
summary Summary

Stability AI introduces SDXL Turbo, a new text-to-image model capable of generating high-quality AI images in real-time.

Ad

SDXL Turbo builds on the foundation of SDXL 1.0 and implements a new distillation technique for text-to-image models: Adversarial Diffusion Distillation (ADD). This technique reduces the number of image generation steps from 50 to a single step, while maintaining a high image quality.

Like other distillation techniques, ADD uses a previously trained large diffusion image model as a teacher network. You can read the SDXL Turbo research paper detailing the new distillation technique of this model here.

By integrating ADD, SDXL Turbo offers many of the advantages of Generative Adversarial Networks (GANs), such as single-step image output, while avoiding artifacts or blurring often seen in other distillation methods, Stability AI writes.

Ad
Ad

At the same time, it provides higher-quality single-step generation. With just four steps, SDXL Turbo is said to achieve the image quality of SDXL with 50 steps.

SDXL Turbo beats SDXL in just four steps

Stability AI compared several model variants (StyleGAN-T++, OpenMUSE, IF-XL, SDXL, and LCM-XL) by generating images with the same prompt.

Human evaluators were then shown two random outputs and asked to select the output that most closely matched the prompt. Another test was then conducted using the same method for image quality.

In these blind tests, SDXL Turbo outperformed a 4-step configuration of LCM-XL with only one step, and a 50-step configuration of SDXL with only four steps.

Image: Stability AI

The comparison with 50-step SDXL in particular shows that SDXL Turbo can significantly outperform a computationally intensive multi-step model with much lower processing overhead in terms of speed, and even slightly outperform it in terms of image quality.

Recommendation

In addition, SDXL Turbo offers significant improvements in inference speed. On an Nvidia A100, SDXL Turbo generates a 512x512 image in just 207 ms (prompt encoding + a single denoising step + decoding, fp16).

If you want to test a free demo of Stable Diffusion XL Turbo, you can do so on Clipdrop. The demo is not intended for commercial use. If you are interested in commercial use, you can contact Stability AI.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Stability AI introduces SDXL Turbo, a text-to-image model that uses Adversarial Diffusion Distillation (ADD) technology to generate high-quality AI images in real-time.
  • SDXL Turbo reduces the number of image generation steps from 50 to just one. It can achieve the quality of SDXL in as few as four steps.
  • In blind tests, SDXL Turbo outperformed computationally intensive multi-step models not only in speed, but also in image quality. A free demo version is available on Clipdrop, commercial applications should be requested directly from Stability AI.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.