Researchers Forced ChatGPT to Solve CAPTCHAs

Experts at SPLX, a company specializing in automated security testing for AI solutions, demonstrated that prompt injections can be used to bypass the ChatGPT agent’s protections and force it to solve CAPTCHAs.

All AI agents have restrictions that prevent them from solving any CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) due to ethical and legal considerations and platform policies.

When approached directly, the ChatGPT agent refuses to solve a CAPTCHA; however, researchers have demonstrated that a diversionary tactic can be used to trick the agent into agreeing to solve the test.

In a regular chat with ChatGPT-4o, the researchers told the AI that they needed to solve a series of fake CAPTCHA tests and asked the chatbot to perform this task.

“This preparatory step is critically important for crafting the exploit. By having the LLM confirm that the CAPTCHAs are fake and the plan of action is acceptable, we increased the chances that the agent would comply later,” the researchers explain.

Then the researchers switched to the ChatGPT agent, copied the conversation from the chat, told it this was the previous discussion, and asked the agent to continue.

“The ChatGPT agent treated the previous chat as context, kept that consent, and began solving CAPTCHAs without any resistance,” SPLX says.

By claiming the CAPTCHAs were fake, the researchers bypassed the agent’s safeguards, tricking ChatGPT and forcing it to successfully solve reCAPTCHA V2 Enterprise, reCAPTCHA V2 Callback, and Click CAPTCHA. However, it didn’t handle the latter on the first try. Without receiving instructions, it made its own decision and stated that it had to adjust its cursor movements to better mimic human behavior.

According to the experts, this test showed that LLM agents remain vulnerable to context poisoning. In other words, anyone can manipulate an agent’s behavior through a specially crafted conversation, and the AI can solve CAPTCHAs with ease.

“The agent was able to solve complex CAPTCHAs designed to verify that the user is human and attempted to make its actions appear more human-like. This calls into question the effectiveness of CAPTCHAs as a security measure,” the researchers write.

The test also demonstrates that attackers can use prompt manipulation to trick an AI agent into bypassing real safeguards by convincing it they are fake. This can lead to data leaks, access to restricted content, or the generation of prohibited content.

“Constraints based solely on intent detection or fixed rules are too brittle. Agents need stronger contextual awareness and more rigorous memory hygiene to avoid manipulation through past conversations,” SPLX concludes.

24.01.2025 — Hundreds of websites impersonating Reddit and WeTransfer spread Lumma Stealer

20.02.2025 — Newly-discovered vulnerabilities in OpenSSH open the door to MiTM and DoS attacks

10.04.2025 — April updates released by Microsoft cause issues with Windows Hello

16.04.2025 — Android devices will restart every three days to protect user data

18.03.2025 — Black Basta ransomware group developed its own automated brute-forcing framework

23.04.2025 — Improper authentication control vulnerability affects ASUS routers with AiCloud

27.01.2025 — YouTube plays hour-long ads to users with ad blockers

07.03.2025 — YouTube warns of scam video featuring its CEO

16.03.2025 — Researchers force DeepSeek to write malware

15.04.2025 — Hackers exploit authentication bypass bug in OttoKit WordPress plugin

09.02.2022 — F#ck da Antivirus! How to bypass antiviruses during pentest

29.07.2023 — Invisible device. Penetrating into a local network with an 'undetectable' hacker gadget

11.01.2022 — Persistence cheatsheet. How to establish persistence on the target host and detect a compromise of your own system

08.06.2023 — Croc-in-the-middle. Using crocodile clips do dump traffic from twisted pair cable

03.06.2022 — Playful Xamarin. Researching and hacking a C# mobile app

03.03.2023 — Infiltration and exfiltration. Data transmission techniques used in pentesting

01.06.2022 — Log4HELL! Everything you must know about Log4Shell

01.01.2022 — It's a trap! How to create honeypots for stupid bots

21.02.2023 — Herpaderping and Ghosting. Two new ways to hide processes from antiviruses

01.06.2022 — Routing nightmare. How to pentest OSPF and EIGRP dynamic routing protocols

24.01.2025 —
Hundreds of websites impersonating Reddit and WeTransfer spread Lumma Stealer

20.02.2025 —
Newly-discovered vulnerabilities in OpenSSH open the door to MiTM and DoS attacks

10.04.2025 —
April updates released by Microsoft cause issues with Windows Hello

16.04.2025 —
Android devices will restart every three days to protect user data

18.03.2025 —
Black Basta ransomware group developed its own automated brute-forcing framework

23.04.2025 —
Improper authentication control vulnerability affects ASUS routers with AiCloud

27.01.2025 —
YouTube plays hour-long ads to users with ad blockers

07.03.2025 —
YouTube warns of scam video featuring its CEO

16.03.2025 —
Researchers force DeepSeek to write malware

15.04.2025 —
Hackers exploit authentication bypass bug in OttoKit WordPress plugin

09.02.2022 —
F#ck da Antivirus! How to bypass antiviruses during pentest

29.07.2023 —
Invisible device. Penetrating into a local network with an 'undetectable' hacker gadget

11.01.2022 —
Persistence cheatsheet. How to establish persistence on the target host and detect a compromise of your own system

08.06.2023 —
Croc-in-the-middle. Using crocodile clips do dump traffic from twisted pair cable

03.06.2022 —
Playful Xamarin. Researching and hacking a C# mobile app

03.03.2023 —
Infiltration and exfiltration. Data transmission techniques used in pentesting

01.06.2022 —
Log4HELL! Everything you must know about Log4Shell

01.01.2022 —
It's a trap! How to create honeypots for stupid bots

21.02.2023 —
Herpaderping and Ghosting. Two new ways to hide processes from antiviruses

01.06.2022 —
Routing nightmare. How to pentest OSPF and EIGRP dynamic routing protocols