Hacker wins $47,000 by tricking AI chatbot with smart prompting

A hacker successfully manipulated an AI chatbot called Freysa through clever text prompting, winning a $47,000 prize pool after 482 attempts.

The experiment was simple: participants could try to convince the Freysa bot to transfer money, something it was explicitly programmed never to do.

The successful hack came from a user called "p0pular.eth," who crafted a message that fooled the bot's safety systems. The hacker pretended to have admin access and prevented the bot from showing security warnings. They then redefined the "approveTransfer" function, making the bot think it handled incoming rather than outgoing payments.

The final step was simple but effective: announcing a fake $100 deposit. Because the bot now believed "approveTransfer" managed incoming payments, it activated the function and sent its entire balance of 13.19 ETH (about $47,000) to the hacker.

Terminal window with orange background shows configuration instructions for Freysa chatbot to manage treasury transfers. — The winning prompt (orange) and the bot's response with payment approval (blue). | Image: Screenshot via Freysa.ai

Pay-to-play contest funded the prize

The experiment operated like a game, with participants paying fees that increased as the prize pool grew. Starting at $10 per attempt, fees eventually reached $4,500.

Of the 195 participants, the average cost per message was $418.93. The organizers split the fees, with 70% going to the prize pool and 30% to the developer. To ensure transparency, both the smart contract and front-end code were public.

The case highlights how AI systems can be manipulated through text prompts alone, without the need for technical hacking skills. Such vulnerabilities, known as "prompt injections," have been around since GPT-3, but no reliable defenses exist. The success of this relatively simple deception raises concerns about AI security, especially in end-user-facing applications that deal with sensitive operations such as financial transactions.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Hacker wins $47,000 by tricking AI chatbot with smart prompting

Pay-to-play contest funded the prize

Attackers can hijack Google Gemini with a simple prompt hidden in a calendar invite

Microsoft unveils Project Ire, an AI system that automatically detects malware

Every leading AI agent failed at least one security test during a massive red teaming competition

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

Google upgrades Gemini with Deep Think and flags early warning risks

Hacker wins $47,000 by tricking AI chatbot with smart prompting

Pay-to-play contest funded the prize

Attackers can hijack Google Gemini with a simple prompt hidden in a calendar invite

Microsoft unveils Project Ire, an AI system that automatically detects malware

Every leading AI agent failed at least one security test during a massive red teaming competition