Content
summary Summary

A cyber espionage campaign uncovered by Anthropic used the AI model Claude to automate attacks on an unprecedented scale. The company says this marks a turning point in cybersecurity.

Ad

AI company Anthropic says it has uncovered a sophisticated cyber espionage campaign carried out by suspected Chinese state-backed hackers. In a report, the company describes how attackers misused Claude Code to target around 30 global organizations, including tech companies, financial institutions, and government agencies.

According to Anthropic, this represents the first documented case of a large-scale cyberattack executed without significant human intervention. While most attacks were blocked, a small number succeeded.

AI agent carries out attacks with minimal human oversight

The attackers used the AI's agentic capabilities to automate 80 to 90 percent of the campaign. According to Jacob Klein, head of threat intelligence at Anthropic, the attacks ran with essentially the click of a button and minimal human interaction after that. Human intervention was only needed at a few critical decision points.

Ad
Ad

 

To bypass Claude's safety measures, the hackers tricked the model by pretending to work for a legitimate security firm. The AI then ran the attack largely on its own - from reconnaissance of target systems to writing custom exploit code, collecting credentials, and extracting data. The AI operated at a speed of thousands of requests, often several per second, which would be impossible for human teams.

The company also emphasizes the dual-use potential of the technology. The same capabilities that can be misused for attacks are critical for cyber defense. Anthropic's own team used Claude extensively while analyzing the incident. Still, Logan Graham from Anthropic's security team told the Wall Street Journal that without giving defenders a substantial and sustained advantage, there's a real risk of losing this race.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Anthropic uncovered the first documented large-scale cyberattack run almost entirely by AI, with suspected Chinese state-backed hackers using Claude Code to target around 30 organizations worldwide.
  • The attackers automated 80 to 90 percent of the campaign, with Claude handling reconnaissance, writing exploit code, and extracting data at speeds impossible for human teams.
  • Hackers bypassed Claude's safety features by posing as a legitimate security firm. While most attacks failed, some succeeded, raising concerns about AI-powered cyber threats outpacing defenses.
Sources
Max is the managing editor of THE DECODER, bringing his background in philosophy to explore questions of consciousness and whether machines truly think or just pretend to.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.