Meta AI trained on Facebook and Instagram content

Meta used content from Facebook and Instagram to train its new AI assistant, Meta AI. The company might use your input data to improve Meta AI.

Meta AI is based on a custom Llama 2 model combined with Meta's new Emu image model. During Meta's Connect 2023 conference, Nick Clegg, Meta's president of global affairs, told Reuters that the new AI assistant was trained with content from Facebook and Instagram, in addition to publicly available datasets. The text content went into Llama, and the images went into Emu.

According to Clegg, only publicly available posts were used for training. Private posts shared only with family and friends and private messages were excluded. According to Clegg, Meta also avoided public datasets with "a heavy preponderance of personal information."

Much of the data Meta uses for training is publicly available, Clegg said. Data from LinkedIn, for example, would not be used for training. If you feed data into Meta AI, Meta may use it to improve Meta AI's capabilities, a spokesperson told Reuters.

AI Copyright: Clegg expects tough trials

Meta's chief lobbyist expects "fair amount of litigation" in the debate over whether the use of copyrighted data falls under the fair use doctrine.

The fair use doctrine holds that the research and development of fundamentally new technologies or content can circumvent copyright law. AI companies like OpenAI, which has been sued several times, will invoke fair use in upcoming court cases. Clegg expects the courts to agree.

For Meta AI, Meta has built in safeguards to avoid abuse. These include preventing the generation of realistic photos of famous people and content that violates copyright laws.

In its latest image model, DALL-E 3, OpenAI also prevents the generation of images based on the style-defining names of well-known living artists. In addition, the company is offering artists the option to remove their images from the training data of future models.

In addition to its ChatGPT competitor Meta AI, Meta demonstrated a range of generative AI applications for its social platforms at Connect 2023. These include personalized AI chats based on celebrities, AI-powered image editing, and text-based sticker generation.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Recommendation

AI in practice

Meta AI trained on Facebook and Instagram content

AI Copyright: Clegg expects tough trials

OpenAI's Operator and Computer-Using Agent bring autonomous AI agents closer to reality

OpenAI loses four more top researchers to Meta as even its own engineers call it a "huge loss"

Meta wins over Llama book training, but the judge warns future cases could go differently

Meta's latest model highlights the challenge AI faces in long-term planning and causal reasoning

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

Meta AI trained on Facebook and Instagram content

AI Copyright: Clegg expects tough trials

Share

Bank details