AI researcher tests Claude's ability to play humanity-destroying game with mixed results

Midjourney prompted by THE DECODER

Anthropic's Claude 3.5 Sonnet AI can now control computers, and AI researcher Ethan Mollick recently put this capability to the test with an unusual game choice.

The browser game "Paperclip Clicker" is about an AI that destroys humanity in its pursuit of producing paperclips. In his newsletter "One Useful Thing," Mollick describes how Claude's new computer skills demonstrated both the remarkable capabilities and the clear limitations of today's AI agents.

Claude was able to understand the game on its own, develop a long-term strategy, and follow it for hours on end. "It feels like delegating a task rather than managing one," says Mollick, describing his interaction with the AI agent. Claude independently clicked buttons, analyzed screenshots, and adapted its strategy to new game situations.

Smart strategies, basic mistakes

Despite clever approaches like A/B tests for pricing, Claude made fundamental mistakes. For example, the agent miscalculated profits and stuck to its flawed strategy despite Mollick's attempts at correction.

The game Paperclip Clickers with instructions from Claude next to it. — Claude develops strategies to save money on marketing, for example. | Image: oneusefulthing.org | Ethan Mollick

In one notable moment, Claude recognized its nature as a computer system and attempted to write code to automate the game. When that failed, it simply went back to manual control.

"On the weak side, you can see the fragility of current agents," Mollick writes. While Claude responded robustly to many errors, a single mistake in price calculation was enough to lead the agent down an inefficient path.

When the remote desktop system crashed, Claude tried various fixes before declaring itself the winner with an interesting justification: "While we may not be able to progress further due to technical constraints we've successfully "won" the game by reaching a significant milestone and maximizing our capbilites within the given constraints."

Mollick sees the experiment as an indication of the future development of AI agents. While the current generation still shows clear weaknesses, he is "surprised at how capable and flexible this system is already."

A new model for AI interaction

Mollick notes that working with AI agents requires a different approach than previous chatbots. These agents prefer to work independently and are harder to control. "AIs are breaking out of the chatbox and coming into our world," he wrote, adding that while significant limitations remain, agents could soon play a crucial role.

Recommendation

AI in practice

OpenAI unveils o3, its most advanced reasoning model yet

Mollick has expanded his testing beyond Paperclip Clicker, including experiments with Magic the Gathering Arena to further explore Claude's capabilities.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

AI researcher tests Claude's ability to play humanity-destroying game with mixed results

Smart strategies, basic mistakes

A new model for AI interaction

OpenAI unveils o3, its most advanced reasoning model yet

CEO Arison says no single AI model will always meet Grindr’s needs

Brave discovers a security flaw in Perplexity’s Comet browser

xAI has released Grok 2 as an open model with its weights now available for download

Google downplays AI's environmental impact in new study

Deepseek’s first hybrid model V3.1 surpasses its R1 reasoning model on benchmarks

Meta's human-like chatbot personas can mislead users and result in real-world harm

AI researcher tests Claude's ability to play humanity-destroying game with mixed results

Smart strategies, basic mistakes

A new model for AI interaction

Share

Bank details