Content
summary Summary

Agent-1 may soon be able to operate any software. Initially, the AI model will launch as part of a browser extension.

Matt Shumer, CEO of HyperWriteAI and OthersideAI, has announced a foundation model called "Agent-1". It is supposed to be able to operate software like a human. A scientific paper is still missing, but he demonstrated the capabilities in a video.

Current AI models such as GPT-4, and even next-generation models that have not yet been released, are unable to reliably operate software and programming interfaces, Shumer said.

Cost and speed are also major issues with the complexity of software operations, he said. "Agent-1" aims to solve this problem. Shumer promises much: "We’re already well above previous state-of-the-art, and we’re improving massively each week."

Ad
Ad

Chrome plugin integration

Agent-1 is expected to be integrated into HyperWrite's Personal Assistant in the next few updates. It is a browser extension that provides a website-independent AI text generator.

Image: Screenshot/THE DECODER

At the end of June, Shumer unveiled a very early version of Personal Assistant, which can perform simple browser tasks such as sending an email or ordering food.

Agent-1, however, could take on much more complex tasks. In Shumer's demo video, you can see Agent-1 controlling a Google Cloud dashboard.

Image: screenshot/HyperWrite

"Dynamic thinking

"Current models store lots of knowledge, leaving fewer parameters for reasoning," he explains. "Instead, we aim to put all of the model's horsepower to work on dynamic reasoning."

This "dynamic reasoning" approach, he says, allows the model to handle situations for which it has not been trained. Shumer set the bar pretty high for Agent-1:

Recommendation

Our goal is to surpass human ability — an assistant that can operate any software and reliably accomplish complex goals on a user’s behalf.

Matt Shumer, CEO of HyperWrite

As with any AI agent, bugs are to be expected, but Shumer promises that Agent 1 will get better over time, learning as it goes. The next few weeks will tell us if Agent 1 lives up to Shumer's promises - other than the short video, there's no other information to back up his claims.

What was ACT-1 again?

Agent-1 is very reminiscent of Adept's demo of ACT-1. The US startup hasn't been heard from much since its announcement in September 2022, aside from a $350 million Series B funding round that closed in March. Its founders include former researchers from Deepmind, Meta, and Google, one of whom was an author of the Transformer architecture that now underpins models like GPT-4.

Recently, Berkeley University and Microsoft Research also introduced Gorilla, a large language model that has been trained on 1,600 programming interfaces and is capable of operating software.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • According to HyperWrite CEO Matt Shumer, Agent-1 is a "groundbreaking foundation model" that can already serve software better than other AI models.
  • In the next few updates, it will be integrated into the personal assistant in HyperWrite's browser extension.
  • The big promises and goals are so far only supported by a short video.
Jonathan works as a technology journalist who focuses primarily on how easily AI can already be used today and how it can support daily life.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.