Content
summary Summary

Stable Diffusion is a powerful open-source image AI that competes with OpenAI's DALL-E 2. The AI training was probably rather cheap in comparison.

Anyone interested can download the model of the open-source image AI Stable Diffusion for free from Github and run it locally on a compatible graphics card. This must be reasonably powerful (at least 5.1 GB VRAM), but you don't need a high-end computer.

In addition to the local, free version, the Stable Diffusion team also offers access via a web interface. For about $12, you get roughly 1000 image prompts.

One important difference besides the price: the local version runs without restrictions, while the web version blocks prompts that might generate sexual or violent images, for example. DALL-E 2 and Midjourney also have this restriction.

Ad
Ad

Stable Diffusion: AI training for relatively little money

Training the image AI was relatively inexpensive, Emad Mostaque reveals on Twitter. The mathematician and computer scientist founded Stability AI, the startup that is the driving force behind Stable Diffusion.

According to Mostaque, the Stable Diffusion team used a cloud cluster with 256 Nvidia A100 GPUs for training. This required about 150,000 hours, which Mostaque says equates to a market price of about $600,000.

For DALL-E 2, Mostaque assumes a computational cost of approximately one million A100 hours. The training costs of OpenAI's image AI are thus likely to be far higher than those of Stable Diffusion. However, DALL-E 2 currently offers better performance, in part due to its architecture, which does require more training data.

AI training for large models is affordable

Mostaque's explanation is interesting in two respects: First, he mentions concrete training costs. For other large AI models like DALL-E 2 or GPT-3, there is only speculation, sometimes in the millions, but no facts. Thanks to Stable Diffusion, there is now a concrete reference point.

On the other hand, the training costs of $600,000 are within a financial range that many companies can afford.

Recommendation

This, in turn, is an indication that a dominant position of companies such as OpenAI in the field of large AI models need not arise, at least, because of training costs that are unaffordable for others. The real cost drivers are likely to be research and development personnel and data collection and maintenance.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Stable Diffusion is an open-source alternative to DALL-E 2 from OpenAI.
  • One of the brains behind Stable Diffusion is Emad Mostaque. He discloses that training the model cost around $600,000.
  • That's a price range where it should be possible for many companies to train their own large generative AI models.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.