Researchers hid malicious AI prompts inside tiny images

Experts at Trail of Bits have developed a new type of attack that enables the theft of user data by embedding malicious prompts into images, invisible to the human eye.

The attack relies on high-resolution images onto which human-invisible prompts are embedded. The malicious prompts only manifest when the image’s quality and dimensions are reduced using resampling algorithms.

The new method proposed by Trail of Bits specialists Kikimora Morozova (Kikimora Morozova) and Suha Sabi Hussain (Suha Sabi Hussain) is based on the theory described in a talk at the USENIX 2020 conference by researchers from the Technical University of Braunschweig. They studied the feasibility of attacks leveraging image scaling in the context of machine learning.

When users upload images to AI systems, the systems automatically downscale them, reducing quality to improve performance and reduce costs.

Depending on the specific system, image resampling algorithms can simplify the image using nearest-neighbor, bilinear, or bicubic interpolation. All these methods produce artifacts, which can cause hidden patterns to appear in the downscaled image if the original picture was prepared accordingly.

In Trail of Bits’ example, certain dark regions of the malicious image turn red when bicubic interpolation is used during image processing, allowing the hidden text to appear.

The AI model interprets this text as part of the user’s prompt and automatically combines the text from the image with the legitimate instructions.

Although nothing seems suspicious from the user’s perspective, in practice the AI model executes hidden instructions that can lead to data leaks and other dangerous consequences.

Thus, in the case of the Gemini CLI, the researchers were able to exfiltrate Google Calendar data by sending it to an arbitrary email address, using the Zapier MCP with the parameter trust=True to approve tool calls without user confirmation.

Experts emphasize that such an attack must be tailored for each AI model depending on the image processing algorithm used. However, the researchers confirmed that such attacks work against Google Gemini CLI, Vertex AI Studio (with a Gemini backend), in the Gemini web interface, via the Gemini API through the LLM CLI, in Google Assistant on an Android smartphone, as well as in Genspark.

As part of this research, Trail of Bits developed and released an open-source tool called Anamorpher, which can create malicious images for each of the aforementioned processing methods.

To protect against such attacks, the researchers recommend that AI system developers enforce size limits when uploading images. If reducing the image size is truly necessary, it’s recommended to provide users with a preview of the result. The experts also advise requesting explicit user confirmation for potentially risky operations, especially if text is detected in the image.

“However, the most effective defense is implementing secure design patterns and systematic safeguards that prevent dangerous prompt injections—not just multimodal prompt injections,” the researchers say, citing a paper published in June 2025 on building LLMs resilient to prompt-injection attacks.

05.03.2025 — Polish Space Agency disconnects its network due to hacker attack

27.01.2025 — YouTube plays hour-long ads to users with ad blockers

22.04.2025 — Scammers pose as FBI IC3 specialists, offer 'assistance' to fraud victims

18.02.2025 — Chrome Enhanced Protection mode is now powered by AI

14.02.2025 — 12,000 Kerio Control firewalls remain vulnerable to RCE

16.03.2025 — Researchers force DeepSeek to write malware

29.01.2025 — Google to disable Sync in older Chrome versions

12.03.2025 — Mass exploitation of PHP-CGI vulnerability in attacks targeting Japanese companies

09.02.2025 — Abandoned AWS S3 buckets could be used in attacks targeting supply chains

27.01.2025 — Zyxel firewalls reboot due to flawed update

03.06.2022 — Challenge the Keemaker! How to bypass antiviruses and inject shellcode into KeePass memory

12.01.2022 — First contact. Attacks against contactless cards

09.02.2022 — Kernel exploitation for newbies: from compilation to privilege escalation

08.06.2023 — Cold boot attack. Dumping RAM with a USB flash drive

01.06.2022 — F#ck AMSI! How to bypass Antimalware Scan Interface and infect Windows

22.01.2023 — Top 5 Ways to Use a VPN for Enhanced Online Privacy and Security

01.06.2022 — Log4HELL! Everything you must know about Log4Shell

11.01.2022 — Persistence cheatsheet. How to establish persistence on the target host and detect a compromise of your own system

09.02.2022 — Dangerous developments: An overview of vulnerabilities in coding services

03.06.2022 — Playful Xamarin. Researching and hacking a C# mobile app

05.03.2025 —
Polish Space Agency disconnects its network due to hacker attack

27.01.2025 —
YouTube plays hour-long ads to users with ad blockers

22.04.2025 —
Scammers pose as FBI IC3 specialists, offer 'assistance' to fraud victims

18.02.2025 —
Chrome Enhanced Protection mode is now powered by AI

14.02.2025 —
12,000 Kerio Control firewalls remain vulnerable to RCE

16.03.2025 —
Researchers force DeepSeek to write malware

29.01.2025 —
Google to disable Sync in older Chrome versions

12.03.2025 —
Mass exploitation of PHP-CGI vulnerability in attacks targeting Japanese companies

09.02.2025 —
Abandoned AWS S3 buckets could be used in attacks targeting supply chains

27.01.2025 —
Zyxel firewalls reboot due to flawed update

03.06.2022 —
Challenge the Keemaker! How to bypass antiviruses and inject shellcode into KeePass memory

12.01.2022 —
First contact. Attacks against contactless cards

09.02.2022 —
Kernel exploitation for newbies: from compilation to privilege escalation

08.06.2023 —
Cold boot attack. Dumping RAM with a USB flash drive

01.06.2022 —
F#ck AMSI! How to bypass Antimalware Scan Interface and infect Windows

22.01.2023 —
Top 5 Ways to Use a VPN for Enhanced Online Privacy and Security

01.06.2022 —
Log4HELL! Everything you must know about Log4Shell

11.01.2022 —
Persistence cheatsheet. How to establish persistence on the target host and detect a compromise of your own system

09.02.2022 —
Dangerous developments: An overview of vulnerabilities in coding services

03.06.2022 —
Playful Xamarin. Researching and hacking a C# mobile app