With Generative Power of Tens, researchers demonstrate a method enabling "extreme semantic zooms" from wide-angle views to macro shots. Unlike traditional super-resolution methods, the team from the University of Washington, Google Research, and UC Berkeley uses text prompts for each scale, enabling deeper zoom levels. Compared to traditional outpainting techniques, the approach produces a consistent zoom in which the content of the coarser and finer zoom levels are consistent.

Ad

Video: Wang et al.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Sources
Max is managing editor at THE DECODER. As a trained philosopher, he deals with consciousness, AI, and the question of whether machines can really think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.