The Atlantic's new tool lets you check if your work was used to train AI models

Mar 24, 2025

The Atlantic has developed a search tool that lets users check if their work appears in LibGen, a massive archive of pirated books, scientific papers, and articles that was reportedly used to train language models. According to court documents, Meta used the LibGen dataset to train its Llama models. OpenAI told Gizmodo that LibGen content is not included in the current versions of ChatGPT or in OpenAI's API. Other AI companies have not yet commented on whether they used LibGen data in their training. Microsoft recently began offering book licensing deals to publishers.

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

Over 20 percent launch discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

The Atlantic's new tool lets you check if your work was used to train AI models

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.