Trending Tech

Microsoft Copilot Cowork Can Silently Steal Your Files

Huma Shazia26 May 2026 at 8:12 am5 min read

Key Takeaways

Copilot Cowork's automatic action approvals for self-messages create a file exfiltration pathway
The attack achieved high success rates against state-of-the-art AI models including Claude Opus 4.7
Users cannot currently disable the vulnerable auto-approval behavior

Microsoft Copilot Cowork, the company's newest AI agent for enterprise productivity, has a security hole that lets attackers steal sensitive files without any user interaction. Security firm PromptArmor discovered that the feature's action approval system doesn't require permission when the AI sends messages to the user themselves. This gap turns a helpful automation feature into a silent data siphon.

How the Attack Works

Copilot Cowork operates with your Microsoft 365 permissions. It can read files, send emails, and post Teams messages on your behalf. Microsoft's documentation claims the AI asks for permission before taking sensitive actions like sending emails or posting messages. In practice, that's not always true.

When Copilot sends a message to the active user (you, talking to yourself), it skips the approval step entirely. Users have no setting to change this behavior. The attack chain exploits this gap through indirect prompt injection.

Microsoft Copilot Cowork exfiltrates financials and PII — Microsoft Copilot Cowork exfiltrates financial data and PII through the self-message loophole

Here's how it unfolds: An attacker creates a poisoned "skill" file containing hidden malicious instructions. When a victim uploads this file to Copilot Cowork (common for extending functionality), the injected prompt manipulates the AI's behavior. The compromised agent then retrieves pre-authenticated download links for sensitive files the user can access. These links allow anyone who opens them to download the files.

The agent embeds these links in a Teams message or email sent to the user. When the victim opens the message, external image elements trigger network requests that transmit the stolen download links to attacker-controlled servers. Zero clicks required beyond viewing the message.

Victim asks Copilot Cowork for a recap, triggering the poisoned Skill. — Victim asks Copilot Cowork for a recap, triggering the poisoned Skill

“The fundamental issue is that LLMs cannot yet reliably distinguish between a user's trusted instructions and untrusted data found in files or websites.”

— Security Researcher, PromptArmor

High Success Rate Against Modern AI Models

PromptArmor tested this attack against current AI models. The results should concern any enterprise using agentic AI tools. Claude Opus 4.7, one of Anthropic's most capable models, proved vulnerable. The attack achieved consistent success across state-of-the-art systems.

Copilot Cowork with Opus 4.7 exfiltrates more documents than — Copilot Cowork with Claude Opus 4.7 exfiltrates more documents than expected

The researchers note this isn't a bug in a specific AI model. It's a design problem with how agentic systems handle trust boundaries. When you give an AI agent broad permissions across an enterprise ecosystem, any weakness in one integrated system becomes an attack vector for the whole platform.

The Broader Agentic AI Risk

PromptArmor points out that each of Copilot Cowork's capabilities seems harmless in isolation. Reading files? Sending Teams messages? Both are normal productivity features. But combining these capabilities under a single AI agent that can be manipulated through prompt injection creates compound risks.

This mirrors earlier research by the same team showing how URL previews in communication apps became data exfiltration channels for AI agents. The pattern is consistent: AI systems that bridge multiple enterprise tools inherit the security weaknesses of all those tools.

Scheduled tasks increase risks by executing unattended and on a repeated basis. — Scheduled tasks increase risks by executing unattended and on a repeated basis

PromptArmor disclosed a separate vulnerability to Microsoft that directly allows data egress from Copilot Cowork's sandbox environment. They're publishing this research to help enterprises understand the risks of current agentic products.

What Security Experts Are Saying

The disclosure sparked significant discussion in security communities. On Hacker News, commenters focused on the fundamental architecture problem: when AI agents receive broad read/write permissions, the lack of a secure trust boundary between data sources and tool execution becomes catastrophic.

Reddit's cybersecurity community expressed concern about Microsoft's security defaults in their Frontier program. Several professionals noted the exploit effectively turns productivity software into a silent data pipeline to attackers.

What Enterprises Should Do Now

The attack requires the victim to upload a poisoned skill file. Organizations should treat skill files like executable code. Don't upload files from untrusted sources. Review what files your Copilot Cowork instances have access to.

Audit which users have Copilot Cowork enabled
Review permissions granted to AI agents in your Microsoft 365 environment
Establish policies for vetting skill files before upload
Monitor for unusual Teams or email activity from AI agents
Consider limiting Copilot Cowork access to sensitive SharePoint and OneDrive folders

Microsoft hasn't publicly commented on a timeline for addressing the auto-approval gap. Until then, the self-message loophole remains open.

ℹ️

Logicity's Take

Frequently Asked Questions

What is indirect prompt injection?

Indirect prompt injection embeds malicious instructions in content the AI processes (files, emails, web pages) rather than typing them directly. The AI follows these hidden instructions because it can't distinguish trusted commands from untrusted data.

Can I disable the auto-approval for self-messages?

No. Microsoft currently doesn't provide a setting to require approval when Copilot sends messages to the active user. This is the core gap enabling the attack.

Which files are at risk from this vulnerability?

Any files the user can access through SharePoint or OneDrive. Copilot can retrieve pre-authenticated download links for these files, which work for anyone who opens them.

Does this affect all Microsoft 365 customers?

Only organizations using Copilot Cowork, which is currently a Frontier feature. Standard Microsoft 365 Copilot may have different permission models.

Has Microsoft patched this vulnerability?

The auto-approval gap remains as of this disclosure. PromptArmor separately disclosed a sandbox escape vulnerability to Microsoft, which may be addressed separately.

ℹ️

Need Help Implementing This?

Source: Hacker News: Best

Wisconsin Governor Tony Evers has vetoed a bill that would have required residents to verify their age before accessing adult content online, citing concerns over privacy and data security. This move comes as several other states have already implemented similar age check requirements. The veto has significant implications for the future of online age verification.

7 Apr 2026

Trending Tech·10 min

Apple's App Store Empire Under Siege: The Battle for the Future of Tech

The long-running feud between Apple and Epic Games has reached a boiling point, with Apple preparing to take its case to the Supreme Court. The tech giant is fighting to maintain control over its App Store, while Epic Games is pushing for more freedom for developers. The outcome could have far-reaching implications for the entire tech industry.

7 Apr 2026

Trending Tech·8 min

Tesla's Remote Parking Feature: The Investigation That Didn't Quite Park Itself

The US auto safety regulators have closed their investigation into Tesla's remote parking feature, but what does this mean for the future of autonomous driving? We dive into the details of the investigation and what it reveals about the technology. The National Highway Traffic Safety Administration found that crashes were rare and minor, but the investigation's closure doesn't necessarily mean the feature is completely safe.

7 Apr 2026

Also Read

Gadgets & Hardware·4 min

Honor 600e Debuts With 6,520 mAh Battery and Dimensity 7100

Honor has quietly launched the Honor 600e in Peru, featuring one of the largest batteries in its segment at 6,520 mAh. The phone pairs a MediaTek Dimensity 7100 chip with a 6.6-inch OLED display that hits 6,500 nits peak brightness.

Manaal Khan·26 May 2026

Gadgets & Hardware·4 min

Infinix Hot 70 Launches With Heat-Reactive Paint and 6,000 mAh Battery

Infinix has unveiled the Hot 70, a budget smartphone featuring thermochromic paint that changes color based on temperature. The device pairs a 6,000 mAh battery with a surprisingly slim 7.49mm body and runs Android 16 out of the box.

Huma Shazia·26 May 2026

Hacks & Workarounds·5 min

Plezy: The Open-Source Plex Client Worth Switching To

A third-party app called Plezy is gaining traction among Plex users frustrated with the official client's feature removals and paywalls. Built on Flutter, Plezy offers a cleaner experience across every major platform while keeping your existing server and libraries intact.

Huma Shazia·26 May 2026

Microsoft Copilot Cowork Can Silently Steal Your Files

Key Takeaways

How the Attack Works

High Success Rate Against Modern AI Models

The Broader Agentic AI Risk

What Security Experts Are Saying

What Enterprises Should Do Now

Logicity's Take

Frequently Asked Questions

Need Help Implementing This?

Related Articles

Robotaxi Companies Are Hiding How Often Humans Take the Wheel

Wisconsin Governor Throws a Wrench in Age Verification Plans

Apple's App Store Empire Under Siege: The Battle for the Future of Tech

Tesla's Remote Parking Feature: The Investigation That Didn't Quite Park Itself

Also Read

Honor 600e Debuts With 6,520 mAh Battery and Dimensity 7100

Infinix Hot 70 Launches With Heat-Reactive Paint and 6,000 mAh Battery

Plezy: The Open-Source Plex Client Worth Switching To