Content
summary Summary

Snapchat's researchers have developed a new method for AI images on smartphones. This should allow users to eliminate the hardware that would otherwise be required and enjoy greater privacy.

Recent versions of image AI, such as Midjourney 5.1, Stable Diffusion XL, and Adobe Firefly, have raised the quality of generated graphics to a new level. However, these models also have undeniable drawbacks: they are very large and have complex network architectures, which makes them computationally intensive and slow.

Running these models at scale requires high-end GPUs and cloud-based inference, which is expensive and raises privacy concerns. Researchers at Snapchat's parent company, Snap Inc, and Northeastern University are now demonstrating SnapFusion. This model is said to be the first to run diffusion models on a smartphone in less than two seconds.

Image: Li et al.

Although chipmaker Qualcomm showed in February that it could generate AI images on a smartphone in less than 15 seconds, SnapFusion runs much faster, at least on the iPhone 14 Pro.

Ad
Ad

Images on par with Stable Diffusion v1.5

By introducing a more efficient network architecture and fewer inference steps, SnapFusion is able to generate a 512-by-512-pixel image from a text prompt in a short time, approaching the quality of Stable Diffusion v1.5, according to the team. To do this, SnapFusion requires only eight denoising steps, while Qualcomm's method requires 20 steps.

A demo video from the researchers shows SnapFusion in action on Apple's most powerful smartphone to date, an iPhone 14 Pro. Qualcomm's method was previously only possible with its latest high-end chip, the Snapdragon 8 Gen 2.

Democratizing image AIs

"Our work democratizes content creation by bringing powerful text-to-image diffusion models to the hands of users," the researchers say, explaining their motivation for working on the project. But SnapFusion is far from perfect.

According to the researchers, the model still has a relatively large number of parameters. In addition, in the near future, work will need to be done to make the technology work on more smartphones than just the iPhone 14 Pro to make it accessible to a broader mass.

Snapchat already has experience with generative AI, but more in the text space with its personal chatbot, My AI.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Recommendation
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • SnapFusion is a new method developed by Snapchat researchers that creates an image on smartphones in less than two seconds.
  • The resolution is 512 x 512 pixels and is said to be on par with Stable Diffusion v1.5.
  • So far, SnapFusion only works on an iPhone 14 Pro.
Sources
Jonathan works as a technology journalist who focuses primarily on how easily AI can already be used today and how it can support daily life.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.