Fooocus is Stable Diffusion for people who don't like prompt engineering
Key Points
- Fooocus is a project that aims to simplify the operation of the generative AI model Stable Diffusion for image generation by allowing users to focus on the image and prompt without having to make manual adjustments.
- The software offers easy installation as well as automated optimizations and quality enhancements that must be manually adjusted in other interfaces. Advanced users can make additional adjustments via the Advanced tab.
- Fooocus requires at least an Nvidia graphics card with 4 gigabytes of RAM, 8 gigabytes of system RAM and Windows as the operating system.
Stable Diffusion is a powerful generative AI model for images, but operating it via web and local interfaces often involves a lot of tweaking and prompt engineering. Fooocus aims to change that.
Lvmin Zhang, the person behind Fooocus, describes the project as a reworking of the design of Stable Diffusion and Midjourney. From Stable Diffusion, Fooocus takes the model and the focus on offline capabilities and open source, from Midjourney the focus on ease of use: manual adjustments of values like CFG are not required, users can simply focus on the image and the prompt.
In short, Fooocus is like a free offline version of Midjourney using the latest SDXL model from Stability AI. Although Midjourney usually still gives better results in my short test, Fooocus with SDXL comes close.
Fooocus has low requirements and easy installation
Fooocus comes with a simple installation, and the number of mouse clicks between hitting "download" and generating the first image is kept to a minimum, promises Zhang.
Behind the scenes, the project has built-in and automated many optimizations and quality improvements that have to be set manually for Stable Diffusion in other interfaces. As with Midjourney, this should give good results on every attempt. If you want to do more, you can use the Advanced tab in Fooocus. Here you can set a sharpness filter or custom LoRAs, for example. You can also set the style by making a simple selection.
It requires at least an Nvidia graphics card with 4 gigabytes of RAM and 8 gigabytes of system RAM under Windows. Microsoft's Virtual Swap needs to be enabled, but this is usually done automatically and can be turned on relatively easily. On a laptop with 16 gigabytes of system RAM and an Nvidia 3060 with 6 gigabytes of RAM, Zhang reports that image generation takes less than 1.5 seconds.
More information and the download are available on the Fooocus GitHub.
AI News Without the Hype – Curated by Humans
As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.
Subscribe now


