Generative Power of Tens: Google shows off impressive AI zoom

Max is the managing editor of THE DECODER, bringing his background in philosophy to explore questions of consciousness and whether machines truly think or just pretend to.

Profile

E-Mail

With Generative Power of Tens, researchers demonstrate a method enabling "extreme semantic zooms" from wide-angle views to macro shots. Unlike traditional super-resolution methods, the team from the University of Washington, Google Research, and UC Berkeley uses text prompts for each scale, enabling deeper zoom levels. Compared to traditional outpainting techniques, the approach produces a consistent zoom in which the content of the coarser and finer zoom levels are consistent.

Video: Wang et al.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

GitHub Arxiv

Maximilian Schreiner

Max is the managing editor of THE DECODER, bringing his background in philosophy to explore questions of consciousness and whether machines truly think or just pretend to.

Profile

E-Mail