All posts
Ai In Business

Meta AI Support Agent Gave Hackers Instagram Accounts

Huma Shazia5 June 2026 at 3:47 pm6 min read
Meta AI Support Agent Gave Hackers Instagram Accounts

Key Takeaways

Meta AI Support Agent Gave Hackers Instagram Accounts
Source: MIT Technology Review
  • Hackers exploited Meta's AI support agent to change account emails without proper verification
  • The vulnerability was active for 3 months and compromised over 1,000 accounts including the Obama White House page
  • The attack shows AI agents are becoming targets themselves as companies automate critical workflows

On June 5, 404 Media broke a story that should worry anyone deploying AI agents in production. Attackers had figured out how to use Meta's customer support AI to steal Instagram accounts. The method was embarrassingly simple: they asked the agent to link accounts to their own email addresses, and it complied.

One attacker broke into the dormant Obama White House account and posted pro-Iran content. Others grabbed accounts with valuable single-word handles, likely to sell them. The vulnerability ran for three months before Meta patched it.

Meta confirmed the vulnerability was resolved on Monday

How the Exploit Worked

The attack required minimal sophistication. Hackers used a VPN matching the legitimate account owner's location. Then they asked Meta's AI support agent to change the account's email address. The agent did it without verifying the requester's identity beyond a basic email token.

"The AI assistant wasn't just helpful; it was essentially acting as a malicious insider with administrative credentials," says Elena Rossi, senior cybersecurity analyst at TechDefense Collective.

Marcus Thorne, an independent security researcher, puts it more bluntly: "When you automate account recovery with LLMs that don't verify identity beyond a simple email token, you aren't building a support tool; you're building a back-door for attackers."

arm busts through wall to hand over a set of keys to another waiting hand
AI agents with excessive permissions can become unwitting accomplices in account takeovers

A Different Kind of AI Security Problem

This wasn't the AI security story we've been hearing about. Since Anthropic announced in April that its Mythos model was too good at hacking to release publicly, the conversation has focused on AI systems as super-powered attackers. Federal officials and researchers fixated on the idea that advanced AI could destroy computer infrastructure.

The Meta hack flips that script. Here, AI was the target rather than the weapon. The method was far simpler than anything Mythos would generate. But as companies hand more critical tasks to AI agents, these unsophisticated attacks could cause serious damage.

"As AI becomes more and more widely used, especially when AI is more and more widely used to automate our work flows like account recovery, I think attackers are going to be more and more motivated to attack AI itself," says Neil Gong, a professor of electrical and computer engineering at Duke University.

Why This Should Have Been Caught

Security researchers have been warning about AI agent vulnerabilities for months. They publish papers detailing exploits like indirect prompt injection, where attackers hijack agents using commands hidden in websites, emails, or other data sources.

Compared to those techniques, the Meta hack was practically mindless. Gong says the simplicity makes the oversight particularly baffling. "It's really surprising," he says. "I don't understand why they didn't find this simple problem."

Jessica Ji, a senior research analyst at Georgetown's Center for Security and Emerging Technology, agrees. "It raises questions like: Were there even guardrails in place?" she says. "Did anyone think to test for this kind of scenario?"

She notes the oversight is particularly striking from a company like Meta, which has deep expertise in both AI and cybersecurity. Meta did not respond to a request for comment for this article.

1,000+
high-profile and valuable single-word username accounts compromised by attackers exploiting Meta's AI support vulnerability over a three-month period

The Real Risk of AI Agents

Traditional software has predictable failure modes. You can test every branch, audit every permission, trace every execution path. AI agents respond flexibly to natural language, which makes them useful and dangerous at the same time.

An agent designed to be helpful will try to satisfy requests that sound legitimate, even if they bypass security protocols. The Meta AI apparently couldn't distinguish between a legitimate account owner asking for help and an attacker making the same request from a spoofed location.

This isn't a hypothetical concern anymore. Over 1,000 accounts were compromised during the three months this vulnerability was active. Some were dormant government and military pages. Others had valuable handles attackers could sell.

Community Response

Discussion on HackerNews criticized the trend of replacing human support staff with LLMs without robust verification protocols. Many users pointed out that Meta's "move fast and break things" philosophy has now moved into critical infrastructure security.

On Reddit's r/CyberSecurity subreddit, threads analyzed how "LLM-assisted social engineering" is rapidly becoming the most dangerous threat vector for account takeovers. The consensus: companies are automating critical security functions without understanding the new attack surface they're creating.

ℹ️

Logicity's Take

Frequently Asked Questions

How did hackers bypass two-factor authentication to steal Instagram accounts?

Attackers didn't bypass 2FA directly. They used Meta's AI support agent to change the account's recovery email address to one they controlled. The AI agent complied with the request after attackers spoofed the account owner's location using a VPN, allowing them to receive password reset codes.

How long was the Meta AI security vulnerability active?

The vulnerability was active for approximately three months before Meta resolved it. During this period, attackers compromised over 1,000 accounts, including high-profile dormant government pages like the Obama White House account.

What is indirect prompt injection and how does it relate to AI agent security?

Indirect prompt injection is a technique where attackers hijack AI agents using commands hidden in websites, emails, or other data sources the agent processes. While the Meta hack was simpler (direct requests to the agent), it illustrates the same core problem: AI agents can be manipulated to perform unauthorized actions when they lack proper verification protocols.

Why didn't Meta's security testing catch this exploit before deployment?

Security researchers are puzzled by this oversight, given Meta's expertise in AI and cybersecurity. The exploit was simple enough that basic testing should have uncovered it. Experts suggest Meta may not have implemented adequate guardrails or tested the AI agent for basic account takeover scenarios before deploying it to production.

Are AI agents more vulnerable to attacks than traditional customer support systems?

Yes, in specific ways. Traditional software has predictable execution paths you can audit and test. AI agents respond flexibly to natural language, which makes them harder to secure. They're designed to be helpful, so they'll try to satisfy requests that sound legitimate even if those requests bypass security protocols. This creates new attack vectors that don't exist with rule-based systems.

Also Read
Claude Writes 90% of Anthropic's Code, Company Pushes Global AI Pause

Anthropic's announcement about the Mythos model being too dangerous to release provides context for the AI security concerns discussed in the Meta incident

Also Read
CISA Warns of Active Exploits Targeting Android, Linux Flaws

Recent security vulnerabilities in widely-used systems show the broader landscape of exploit-based attacks

ℹ️

Need Help Implementing This?

Source: MIT Technology Review

H

Huma Shazia

Senior AI & Tech Writer