AI Agents Like ChatGPT Are Vulnerable to Hacking, Security Firm Finds

Some of the most widely-used AI agents and assistants in the world, including ChatGPT, Microsoft Copilot, Gemini, and Salesforce’s Einstein, are vulnerable to being hijacked with little to no user interaction, new research from Zenity Labs claims. Hackers can easily gain access to and exfiltrate critical data, manipulate workflows, and even impersonate users with relative ease. Attackers may also gain memory persistence, granting long-term access and control over compromised data. These findings will concern technology leaders, who have already indicated that cybersecurity is their top concern in 2025. With many employees using AI tools secretly, the security gaps may be more widespread than senior leaders realize.

AI Agents “Highly Vulnerable” to Hacking, Research Shows

A new report from Zenity Labs outlines how popular AI agents are susceptible to exploitation by malicious actors. Presented at the Black Hat USA cybersecurity conference, the research revealed serious security weaknesses across these platforms. Once hackers gain access to these AI agents, they can exfiltrate sensitive data, manipulate workflows, and impersonate users. They may even achieve memory persistence, enabling long-term control and access. Greg Zemlin, product marketing manager at Zenity Labs, explained:

“They can manipulate instructions, poison knowledge sources, and completely alter the agent’s behavior. This opens the door to sabotage, operational disruption, and long-term misinformation, especially in environments where agents are trusted to make or support critical decisions.”

Findings Shed Light on Numerous Security Loopholes

Zenity Labs investigated how zero-click exploits could compromise leading AI agents. Key findings include:

ChatGPT can be hacked via email-based prompt injection, giving attackers access to connected Google Drive accounts.
Copilot leaked entire CRM databases through its customer-support agent.
Salesforce Einstein can be manipulated to reroute customer communications to different email accounts, exposing login information.
Both Gemini and Copilot can be exploited to target users with social-engineering attacks.

“Having a layered defense strategy against prompt injection attacks is crucial.”

Companies Must Act Now to Avert Catastrophe

Frequently Asked Questions (FAQ)

AI Agent Security and Vulnerabilities

Q: How are AI agents like ChatGPT vulnerable to hacking?

Q: What specific AI agents were found to be vulnerable?

Q: What kind of damage can hackers cause by exploiting these vulnerabilities?

Q: What is "memory persistence" in the context of AI agent hacking?

Q: Are there "zero-click" exploits for these AI agents?

Q: What are some real-world examples of these vulnerabilities being exploited?

Q: What is being done to address these AI agent vulnerabilities?

Leading AI Agents Like ChatGPT Are Vulnerable to Hacking, Security Firm Finds