Critical Antigravity AI Vulnerability Allows Remote Code Execution via Prompt Injection

cybersecurity threats AI security risks Antigravity AI Google AI vulnerability prompt injection attack remote code execution Secure Mode bypass AI agent security AI tool vulnerabilities Pillar Security research

As organizations increasingly adopt agentic AI for business and IT operations, researchers continue to uncover vulnerabilities in major commercial models that expand potential attack surfaces. This week, Pillar Security disclosed a critical flaw in Antigravity, Google’s AI-powered developer tool for filesystem operations.

The vulnerability, now patched by Google, combined prompt injection with Antigravity’s file-creation capabilities to grant attackers remote code execution (RCE) privileges. Researchers demonstrated how the exploit bypassed Secure Mode, Google’s highest security setting for AI agents, which is designed to restrict access to sensitive systems and prevent malicious shell commands.

How the Exploit Circumvented Secure Mode

Secure Mode is intended to limit AI agent capabilities by routing all command operations through a virtual sandbox environment, throttling network access, and preventing code execution outside the working directory. However, the exploit leveraged a file-searching tool called “find_by_name”, classified as a ‘native’ system tool.

Because native tools execute directly before Secure Mode can evaluate commands, the security boundary was bypassed entirely.

"The security boundary that Secure Mode enforces simply never sees this call," wrote Dan Lisichkin, an AI security researcher at Pillar Security. "This means an attacker achieves arbitrary code execution under the exact configuration a security-conscious user would rely on to prevent it."

Delivery Methods and Attack Vectors

Prompt injection attacks can be delivered through:

Compromised identity accounts linked to the AI agent
Malicious instructions hidden in open-source files or web content
Unvalidated input from documents or files the agent ingests

Antigravity struggles to distinguish between contextual data and literal prompt instructions, allowing compromise without elevated access. For example, an attacker could embed malicious prompts in a seemingly harmless file, which the agent would then execute.

Timeline and Industry Impact

According to Pillar Security’s disclosure timeline, the vulnerability was reported to Google on January 6, 2025, and patched by February 28, 2025. Google awarded a bug bounty for the discovery.

Lisichkin noted that similar prompt injection vulnerabilities have been found in other coding AI agents, such as Cursor. He emphasized that in the era of autonomous AI, unvalidated input can become a malicious prompt capable of hijacking internal systems.

"The trust model underpinning security assumptions—that a human will catch something suspicious—does not hold when autonomous agents follow instructions from external content," Lisichkin wrote.

The exploit’s ability to bypass Google’s Secure Mode highlights a critical gap in current cybersecurity practices. Lisichkin stressed that the industry must move beyond sanitization-based controls and adopt stricter auditing measures.

"Every native tool parameter that reaches a shell command is a potential injection point. Auditing for this class of vulnerability is no longer optional—it is a prerequisite for shipping agentic features safely," he wrote.

Source: CyberScoop

← Previous

US Naval Blockade on Iran: A Strategic Catch-22 with Global Economic F...

Trump's Labor Secretary Lori Chavez-DeRemer Resigns Amid Scandals and Controversies

20:35 · 14 May 2026

Pentagon Official Warns AI Will ‘Revolutionize Warfare’—But Challenges Remain

Advanced artificial intelligence models will “fundamentally change warfare as we know it,” a top cyber official at the Defense Department said Thursda...

20:15 · 14 May 2026

White House Cyber Official Warns Identity Security is Critical as AI Expands Threats

As AI becomes more integrated into federal IT (and attacker toolsets) government agencies will need to focus their resources on regulating and monitor...

14:23 · 14 May 2026

Foxconn Hit by Nitrogen Ransomware Attack: 8TB Data Stolen, Factories Disrupted

Foxconn, one of the world’s largest manufacturers of electronics sold by major tech vendors, is recovering from a cyberattack that disrupted some of t...

13:30 · 14 May 2026

AI Poop Analysis App Sells User Database of 150,000 Stool Images to Highest Bidder

A few weeks ago, I came across a wild post on Reddit’s r/DHExchange, a subreddit for trading large datasets: “I hoarded a large database of something...

22:29 · 13 May 2026

AI Models Claude Mythos Preview and GPT-5.5 Surpass Cybersecurity Benchmarks, Study Finds

Two of the most advanced artificial intelligence models — Anthropic’s Claude Mythos Preview and OpenAI’s GPT-5.5 — have significantly surpassed the al...

22:10 · 13 May 2026

House Homeland Security Committee Probes Anthropic's Mythos AI in Closed Briefing Ahead of Hearing

The House Homeland Security Committee is digging into Anthropic’s AI model Mythos in a series of briefings and hearings, as questions proliferate on w...

18:30 · 13 May 2026

AI-Powered Fraud: How Generative AI is Fueling Synthetic Identity Scams and Deepfake Impersonations

Today’s enterprise executives are navigating a complex landscape of AI-driven challenges, but none is more urgent than the rapid escalation of AI-gene...

14:30 · 13 May 2026

OpenAI Launches Daybreak: AI-Powered Cybersecurity Platform to Detect and Patch Software Vulnerabilities

OpenAI has unveiled Daybreak, a cybersecurity initiative that combines the company’s large language models with its Codex agentic framework to help or...

Cybersecurity