A vulnerability in ChatGPT Atlas allows injecting malicious instructions into the AI assistant’s memory

Security researchers from LayerX discovered a vulnerability in OpenAI’s newly released ChatGPT Atlas browser. The issue allows attackers to inject malicious instructions into the AI assistant’s memory and execute arbitrary code.

At the core of the attack is a CSRF vulnerability that can be used to inject malicious instructions into ChatGPT’s persistent memory. The compromised memory will be accessible across all devices and sessions, enabling the attacker to carry out various actions (including account takeover, browser takeover, and so on) when the authenticated user tries to use ChatGPT for normal purposes.

OpenAI introduced this persistent memory feature in February 2024 so that the chatbot can retain information about a user’s preferences across conversations, which is intended to make ChatGPT’s responses more personalized and relevant.

“By chaining CSRF with a memory write, an attacker can stealthily inject instructions that persist across all devices, sessions, and even different browsers,” the experts say. “In our tests, after poisoning ChatGPT’s memory, ordinary queries led to code being loaded, privileges being escalated, and data being stolen without triggering any defense mechanisms.”

The attack works as follows:

the user logs in to ChatGPT;
the victim is tricked into visiting a malicious link, for example via social engineering;
the malicious web page uses a CSRF request, exploiting the fact that the user is already logged in, and silently injects hidden instructions into ChatGPT’s memory;
when the user makes a legitimate request to ChatGPT, the compromised memory is activated, which can lead, for example, to code execution.

In other words, if ChatGPT interprets malicious instructions as part of its memory or to-do list, it performs actions the user didn’t request: creates accounts, executes commands, accesses files, and so on. The malicious instructions remain active until the user goes into the settings and deletes them manually.

LayerX has already notified OpenAI representatives about the vulnerability, but there is no patch yet, and the researchers are not disclosing technical details to prevent potential exploitation of the issue.

Experts recommend that ChatGPT Atlas users limit use of the browser, avoid working with email, finances, and other private data, avoid clicking unfamiliar links, and regularly check what actions the AI agent is taking.

It’s worth noting that the ChatGPT Atlas browser is currently available only for macOS. Versions for Windows and Android are expected to arrive in the near future, but OpenAI has not yet provided specific dates.

24.01.2025 — Hundreds of websites impersonating Reddit and WeTransfer spread Lumma Stealer

18.02.2025 — Chrome Enhanced Protection mode is now powered by AI

28.01.2025 — J-magic backdoor attacked Juniper Networks devices using 'magic packets'

28.03.2025 — Zero-day vulnerability in Windows results in NTLM hash leaks

26.01.2025 — Cisco patched a critical vulnerability in Meeting Management

20.02.2025 — Newly-discovered vulnerabilities in OpenSSH open the door to MiTM and DoS attacks

20.03.2025 — 8,000 vulnerabilities identified in WordPress ecosystem in 2024

12.04.2025 — Hackers compromised a bureau within the U.S. Department of the Treasury and spent months in hacked systems

22.04.2025 — Scammers pose as FBI IC3 specialists, offer 'assistance' to fraud victims

07.02.2025 — 768 vulnerabilities were exploited by hackers in 2024

20.04.2023 — Sad Guard. Identifying and exploiting vulnerability in AdGuard driver for Windows

09.02.2022 — First contact: An introduction to credit card security

11.01.2022 — Pentest in your own way. How to create a new testing methodology using OSCP and Hack The Box machines

04.04.2022 — Fastest shot. Optimizing Blind SQL injection

09.02.2022 — F#ck da Antivirus! How to bypass antiviruses during pentest

04.04.2023 — Serpent pyramid. Run malware from the EDR blind spots!

07.07.2023 — VERY bad flash drive. BadUSB attack in detail

19.04.2023 — Kung fu enumeration. Data collection in attacked systems

08.06.2023 — Croc-in-the-middle. Using crocodile clips do dump traffic from twisted pair cable

13.01.2022 — Step by Step. Automating multistep attacks in Burp Suite

24.01.2025 —
Hundreds of websites impersonating Reddit and WeTransfer spread Lumma Stealer

18.02.2025 —
Chrome Enhanced Protection mode is now powered by AI

28.01.2025 —
J-magic backdoor attacked Juniper Networks devices using 'magic packets'

28.03.2025 —
Zero-day vulnerability in Windows results in NTLM hash leaks

26.01.2025 —
Cisco patched a critical vulnerability in Meeting Management

20.02.2025 —
Newly-discovered vulnerabilities in OpenSSH open the door to MiTM and DoS attacks

20.03.2025 —
8,000 vulnerabilities identified in WordPress ecosystem in 2024

12.04.2025 —
Hackers compromised a bureau within the U.S. Department of the Treasury and spent months in hacked systems

22.04.2025 —
Scammers pose as FBI IC3 specialists, offer 'assistance' to fraud victims

07.02.2025 —
768 vulnerabilities were exploited by hackers in 2024

20.04.2023 —
Sad Guard. Identifying and exploiting vulnerability in AdGuard driver for Windows

09.02.2022 —
First contact: An introduction to credit card security

11.01.2022 —
Pentest in your own way. How to create a new testing methodology using OSCP and Hack The Box machines

04.04.2022 —
Fastest shot. Optimizing Blind SQL injection

09.02.2022 —
F#ck da Antivirus! How to bypass antiviruses during pentest

04.04.2023 —
Serpent pyramid. Run malware from the EDR blind spots!

07.07.2023 —
VERY bad flash drive. BadUSB attack in detail

19.04.2023 —
Kung fu enumeration. Data collection in attacked systems

08.06.2023 —
Croc-in-the-middle. Using crocodile clips do dump traffic from twisted pair cable

13.01.2022 —
Step by Step. Automating multistep attacks in Burp Suite