Content
summary Summary

Language models can speed up and automate many tasks in areas such as text or code. What happens when they run themselves?

This new trend in generative AI is also called "self-prompting" or "auto-prompting". The language model develops and executes prompts that can lead to new prompts based on an initial input.

This approach becomes truly powerful when combined with tools such as web search or the ability to test written code. The language model becomes an automatic assistant that can do much more than just generate text or code.

"1 GPT call is a bit like 1 thought. Stringing them together in loops creates agents that can perceive, think, and act, their goals defined in English in prompts," writes OpenAI developer Andrej Karpathy, predicting a future of "AutoOrgs" made up of "AutoCEOs," "AutoCFOs," and so on.

Ad
Ad

Auto-GPT: Self-Prompting is the latest trend in large language models

The most popular self-prompting example at the moment is the experimental open source application "Auto-GPT". According to the Github team, the Python application is designed to independently develop and manage business ideas and generate income.

The program plans step-by-step, justifies decisions and develops plans, which it documents. In addition to integrating GPT-4 for text generation, the system has Internet access for data retrieval, can store data, and can generate speech via the Elevenlabs API. Supposedly, the system is even capable of self-improvement and bug-fixing by generating Python scripts via GPT-4.

The following demo shows how the system, acting as a "chief GPT," autonomously researches upcoming events, identifies "Earth Day," and generates a fancy recipe idea appropriate for that day.

Video: Auto-GPT-4, Github

From language model to everyday helper

The idea behind Auto-GPT and similar projects like Baby-AGI or Jarvis (HuggingGPT) is to network language models and functions to automate complex tasks. The language model acts as a kind of controller that uses other language or expert models and tools in an automated way to achieve a given goal as autonomously as possible.

Recommendation

"By leveraging the strong language capability of ChatGPT and abundant AI models in Hugging Face, HuggingGPT is able to cover numerous sophisticated AI tasks in different modalities and domains and achieve impressive results in language, vision, speech, and other challenging tasks, which paves a new way towards advanced artificial intelligence," writes the team behind HuggingGPT.

Language models as a platform

The idea of using language models to go beyond pure text generation is not new: The startup Adept is working on a universal text control system that allows a language model agent to search a website, research Wikipedia, or operate Excel using only language. OpenAI has officially introduced a concept with similar automation potential as an upcoming feature with the ChatGPT plugins.

Early experiments show that robots can also benefit from the language understanding and world knowledge of large language models. For example, Google uses natural language to control household robots that can combine language understanding with environment and object recognition to perform multistep actions without pre-programming. The following example shows a simple homemade robot that can respond to complex natural language commands with humor thanks to GPT-4 connectivity.

Using auto-prompting for brainstorming is also interesting: given a topic, GPT-4 generates prompts that lead to new prompts, and so on. The software records the results in a mind map structure. The developer calls this "brainstorming on absolute steroids".

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Language models can be more than text generators: By generating and executing their own prompts, they guide themselves to a given goal.
  • Currently, there are a number of experiments like Auto-GPT that try to use language models in this way. ChatGPT's plugins also rely on the steering function of large language models.
  • Language models can also be used as universal computer assistants or in robotics to control everyday robots.
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.