Content
summary Summary

Researchers have developed an AI system that can turn written stories into manga-style comics automatically. The system, called DiffSensei, can maintain consistent character appearances and control page layouts throughout a story.

Ad

The project comes from a collaboration between Peking University, the Shanghai AI Laboratory, and Nanyang Technological University. DiffSensei combines diffusion models with large language models to handle both the visual and narrative elements of manga creation.

To showcase the system's capabilities, the team created a fictional manga about AI pioneers Geoffrey Hinton, Yann LeCun, and Yoshua Bengio. The story follows their quest to develop an AI model that could outperform the Transformer architecture, capturing their struggles, self-doubt, and eventual triumph - culminating in their Nobel Prize win years later.

Image: Wu et al.
Image: Wu et al.

DiffSensei generates personalized manga

The system uses multimodal models and LoRAs to keep characters looking consistent from panel to panel. It creates manga in three steps: generating page layouts, drawing the characters, and adding dialogue.

Ad
Ad
Image: Wu et al.

To train DiffSensei, the researchers built a custom dataset called MangaZero, containing more than 43,000 manga pages and 427,000 individual panels from 48 different series. Each panel was carefully annotated to mark character positions and dialogue placement - details the team says are essential for the system to work properly.

Researchers see potential in manga production

The system isn't perfect yet. It struggles when character reference images are unclear, and sometimes similar-looking characters end up blending together in unexpected ways. Without specific character references, the artwork tends to look generic rather than matching a particular manga style.

Despite these limitations, the researchers believe DiffSensei could help streamline manga production in the future. The technology gives artists, publishers, and creators a new tool for making personalized manga stories while maintaining control over characters and layouts.

The research team has made more examples and their dataset available on the DiffSensei project page.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Researchers from Peking University, the Shanghai AI Laboratory, and Nanyang Technological University have developed DiffSensei, an AI system that can automatically turn written stories into manga-style comics while maintaining consistent character appearances and controlling page layouts.
  • DiffSensei combines diffusion models with large language models to handle both the visual and narrative elements of manga creation. It generates manga in three steps: creating page layouts, drawing the characters, and adding dialogue, using a custom dataset called MangaZero containing over 43,000 annotated manga pages.
  • Although DiffSensei struggles with unclear character references and generic artwork without specific style references, the researchers believe it could help streamline manga production by providing artists, publishers, and creators with a new tool for making personalized manga stories while maintaining control over characters and layouts.
Sources
Max is managing editor at THE DECODER. As a trained philosopher, he deals with consciousness, AI, and the question of whether machines can really think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.