Content
summary Summary

Yann LeCun, Meta's lead AI researcher, believes that large language models (LLMs) will not lead to human-like intelligence. Instead, he is pursuing an alternative "world model" approach.

Ad

LeCun argues that LLMs like those behind ChatGPT have a "very limited understanding of logic" and "do not understand the physical world, do not have persistent memory, cannot reason in any reasonable definition of the term and cannot plan […] hierarchically."

According to LeCun, LLMs can only respond correctly when given the right training data, making them "intrinsically unsafe." He is not counting on the evolution of LLMs to achieve human-like intelligence.

"If you are a student interested in building the next generation of AI systems, don't work on LLMs," LeCun writes. While LLMs are useful "despite their limitations," LeCun says, big companies are already devoting enough effort.

Ad
Ad

Of course, LeCun is promoting his own research here, which focuses on developing a "common sense" world model.

Instead of laboriously learning individual tasks, such as generating text or images, AI should understand the world and then use this basic knowledge to learn to solve tasks more easily and, above all, much more efficiently, similar to how humans learn.

To achieve this, LeCun says, AI models must understand the physical world, have persistent memory, reason, and plan, possibly hierarchically. "Four essential characteristics necessary for intelligent behavior, which humans and many animals exhibit," LeCun writes.

LeCun first presented his concept of autonomous AI in the spring of 2022 and has been developing it ever since. He estimates that it could take up to ten years for this vision to become a reality.

Meta's Fundamental AI Research (FAIR) Lab is already focused on this new generation of AI, which is necessary to develop useful AI agents for everyday use, LeCun says.

Recommendation

"[Achieving AGI is] not a product design problem, it’s not even a technology development problem, it’s very much a scientific problem," LeCun tells the Financial Times.

Nevertheless, Meta invests heavily in LLM research and development, sometimes outpacing its commercial competitors with its open-source Llama model. The company also implements Llama in its own products, such as Meta AI.

A particularly capable version has been announced for the latest model, Llama 3, which might outperform OpenAI's GPT-4. It is not yet clear whether the largest version of Llama 3 will also be open source.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Yann LeCun, Meta's leading AI researcher, does not believe that Large Language Models (LLMs) like ChatGPT are the way to human-like intelligence. They would have limitations in logic, understanding the physical world, memory, rational reasoning, and hierarchical planning.
  • Instead, LeCun is focusing on developing a "common sense" AI world model that understands the world and then learns to solve tasks efficiently - similar to humans. It could take up to a decade of research to realize this vision, LeCun estimates.
  • Despite the criticism of LLMs, Meta is also investing heavily in their development, releasing a capable open-source model in the form of Llama and implementing it in its own products. A version of Llama 3 has been announced that will outperform OpenAI's GPT-4.
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.