How hackers used Meta’s own AI to break into Instagram accounts

Meta has patched an AI chatbot vulnerability that allowed hackers to reset Instagram account credentials through prompt injection and recovery email changes. Here is what to know

Instagram Meta AI hackMeta has said the vulnerability has been patched, but not before several prominent Instagram accounts were impacted by it. (Magnific)

Hackers tricked Meta’s own AI-powered support chatbot into handing over access to Instagram accounts, by simply prompting the bot to change account recovery details while using VPN to hide their location. The incident underscores the risks of giving AI systems authority over sensitive security functions without adequate safeguards.

Meta has said the vulnerability has been patched, but not before several prominent Instagram accounts were impacted by it. 404Media, which first reported the security flaw, said that hackers managed to gain access to accounts of Barack Obama’s White House account, beauty retailer Sephora and the US Space Force chief master sergeant, John Bentivegna, among others.

The issue first came to light after hackers began sharing videos and screenshots on Telegram and other social media platforms showing how Meta’s AI support assistant could be manipulated into linking a target Instagram account to a new email address controlled by the attacker. Once the new email was attached, the chatbot would facilitate a password reset process, effectively allowing the attacker to seize control of the account.

The company began rolling out its AI-powered customer support assistant for Instagram and Facebook users in March as part of its broader push to automate routine account-management tasks. The chatbot was designed to help users recover accounts, troubleshoot login issues, manage security settings and resolve common support queries without requiring intervention from a human agent.

How did the hack work?

According to videos of the hack on social media, the attack was surprisingly simple. Attackers would start a conversation with Meta’s AI-powered support chatbot and claim that they wanted to update the email address associated with an Instagram account. They would provide the target username along with an email address under their control.

The chatbot would then send a verification code to the attacker’s email address rather than the legitimate account owner’s email. After the attacker entered the code into the chat, the AI assistant would provide a password reset option. Once the password was changed, control of the Instagram account effectively passed to the attacker. At no stage was the attacker required to prove ownership of the original account or gain access to the victim’s email inbox.

Hackers are also said to have used VPNs to make it appear as though they were logging in from locations close to the victim, potentially helping them bypass additional security checks.

This kind of hack is known as a ‘prompt injection’ attack, which, as the name suggests, is an exploit designed around getting AI systems to generate responses based on careful prompts that they are not technically authorised to.

Story continues below this ad

What is a prompt injection attack?

A prompt injection attack is a technique in which an attacker manipulates an AI system by feeding it carefully crafted instructions that override or bypass its intended rules. Large language models such as chatbots rely on natural-language instructions to decide how to respond. If the system has access to sensitive functions—such as changing account settings, approving transactions or resetting passwords—an attacker may be able to trick it into performing actions that its designers never intended.

Newsletter

Follow our daily newsletter so you never miss anything important. On Wednesday, we answer readers' questions.

Subscribe

Unlike conventional cyberattacks that exploit software vulnerabilities through code, prompt injection attacks exploit the AI’s decision-making process through language.

Security researchers have repeatedly warned that prompt injection is one of the biggest unresolved challenges in AI security. The problem becomes particularly serious when AI models are connected to external tools and granted the ability to take actions on behalf of users. In Meta’s case, the chatbot appears to have trusted information supplied during a conversation without adequately verifying that the person making the request actually owned the Instagram account.

The cybersecurity blog CyberSec Guru described the incident as a classic “confused deputy” problem, a long-recognised security flaw where a system with elevated privileges is tricked into acting on behalf of an unauthorised party. In this case, the “deputy” was Meta’s AI support bot, which had the authority to modify account recovery information but failed to adequately verify the identity of the requester.

Soumyarendra Barik is a Special Correspondent with The Indian Express, specializing in the complex and evolving intersection of technology, policy, and society. With over five years of newsroom experience, he is a key voice in documenting how digital transformations impact the daily lives of Indian citizens. Expertise & Focus Areas Barik’s reporting delves into the regulatory and human aspects of the tech world. His core areas of focus include: The Gig Economy: He extensively covers the rights and working conditions of gig workers in India. Tech Policy & Regulation: Analysis of policy interventions that impact Big Tech companies and the broader digital ecosystem. Digital Rights: Reporting on data privacy, internet freedom, and India's prevalent digital divide. Authoritativeness & On-Ground Reporting: Barik is known for his immersive and data-driven approach to journalism. A notable example of his commitment to authentic storytelling involves him tailing a food delivery worker for over 12 hours. This investigative piece quantified the meager earnings and physical toll involved in the profession, providing a verified, ground-level perspective often missing in tech reporting. Personal Interests Outside of the newsroom, Soumyarendra is a self-confessed nerd about horology (watches), follows Formula 1 racing closely, and is an avid football fan. Find all stories by Soumyarendra Barik here. ... Read More

 

Advertisement
Loading Recommendations...
Advertisement
Latest Comment
Post Comment
Read Comments