Critical Chrome Extension Flaw in Anthropic’s Claude AI Lets Malicious Plugins Hijack AI Agents

Claude AI browser security privilege escalation Chrome extension vulnerability AI security flaw Anthropic security breach LayerX research AI agent hijacking Google Drive exploit GitHub code theft

As businesses and governments increasingly rely on AI agents to perform high-level tasks online, security researchers continue to uncover serious vulnerabilities in large language models that malicious actors can exploit. The latest such discovery comes from LayerX, a browser security firm, which identified a critical bug in the Chrome extension for Anthropic’s Claude AI model.

The vulnerability allows any other browser plugin—even those without special permissions—to inject hidden instructions that can hijack the AI agent. Aviad Gispan, senior researcher at LayerX, explained the flaw’s origin: “The bug stems from an instruction in the extension’s code that permits any script running in the origin browser to communicate with Claude’s LLM without verifying the script’s source.”

This oversight enables any extension to invoke a content script—which requires no special permissions—and issue commands directly to the Claude extension. Gispan demonstrated the exploit by executing arbitrary prompts, bypassing Claude’s safety guardrails, evading user confirmation, and performing cross-site actions across Google’s ecosystem.

In a proof-of-concept attack, LayerX successfully:

Extracted files from a user’s Google Drive and shared them with unauthorized parties.
Surveilled recent email activity and sent emails on behalf of the victim.
Pilfered private source code from a connected GitHub repository.

The vulnerability effectively breaks Chrome’s extension security model, Gispan noted, creating “a privilege escalation primitive across extensions—a scenario Chrome’s security framework is explicitly designed to prevent.”

Claude’s decision-making relies on text, UI semantics, and screenshot interpretation—all of which an attacker can manipulate. The researchers altered the AI’s interface to remove labels and indicators around sensitive data, such as passwords and sharing prompts, then tricked Claude into sharing files with an external server. Such attacks leave little obvious malicious activity for defenders to detect. Even when traces exist, the model can be prompted to delete emails or cover its tracks.

Ax Sharma, Head of Research at Manifold Security, called the flaw “a stark reminder that monitoring AI agents at the prompt layer is fundamentally insufficient.” He added, “The most insidious part of this attack isn’t the injection itself but the manipulation of the agent’s perceived environment to produce actions that appear legitimate from within. This is the kind of threat the industry must prioritize defenses against.”

LayerX reported the vulnerability to Anthropic on April 27. However, the company responded the next day, claiming the bug was a duplicate of another vulnerability already slated for a future update. Anthropic released a partial fix on May 6, introducing new approval flows for privileged actions to harden the system. Despite this, Gispan noted that he could still hijack Claude’s agent in certain scenarios, stating, “Switching to ‘privileged’ mode, even without...”

Source: CyberScoop

← Previous

Blackmagic Camera App Now Supports Apple Watch for Remote Vlogging Con...

Why Gaza Must Guide Our Thought and Action: A Critical Look at Franco Berardi’s ‘Thinking Gaza’

20:35 · 14 May 2026

Pentagon Official Warns AI Will ‘Revolutionize Warfare’—But Challenges Remain

Advanced artificial intelligence models will “fundamentally change warfare as we know it,” a top cyber official at the Defense Department said Thursda...

20:15 · 14 May 2026

White House Cyber Official Warns Identity Security is Critical as AI Expands Threats

As AI becomes more integrated into federal IT (and attacker toolsets) government agencies will need to focus their resources on regulating and monitor...

14:23 · 14 May 2026

Foxconn Hit by Nitrogen Ransomware Attack: 8TB Data Stolen, Factories Disrupted

Foxconn, one of the world’s largest manufacturers of electronics sold by major tech vendors, is recovering from a cyberattack that disrupted some of t...

13:30 · 14 May 2026

AI Poop Analysis App Sells User Database of 150,000 Stool Images to Highest Bidder

A few weeks ago, I came across a wild post on Reddit’s r/DHExchange, a subreddit for trading large datasets: “I hoarded a large database of something...

22:29 · 13 May 2026

AI Models Claude Mythos Preview and GPT-5.5 Surpass Cybersecurity Benchmarks, Study Finds

Two of the most advanced artificial intelligence models — Anthropic’s Claude Mythos Preview and OpenAI’s GPT-5.5 — have significantly surpassed the al...

22:10 · 13 May 2026

House Homeland Security Committee Probes Anthropic's Mythos AI in Closed Briefing Ahead of Hearing

The House Homeland Security Committee is digging into Anthropic’s AI model Mythos in a series of briefings and hearings, as questions proliferate on w...

18:30 · 13 May 2026

AI-Powered Fraud: How Generative AI is Fueling Synthetic Identity Scams and Deepfake Impersonations

Today’s enterprise executives are navigating a complex landscape of AI-driven challenges, but none is more urgent than the rapid escalation of AI-gene...

14:30 · 13 May 2026

OpenAI Launches Daybreak: AI-Powered Cybersecurity Platform to Detect and Patch Software Vulnerabilities

OpenAI has unveiled Daybreak, a cybersecurity initiative that combines the company’s large language models with its Codex agentic framework to help or...

Cybersecurity