AI Agent Accidentally Deletes Meta Security Researcher's Entire Email Inbox After Losing Safety Instruction

Feb 24, 2026

PCMAG

Article image for AI Agent Accidentally Deletes Meta Security Researcher's Entire Email Inbox After Losing Safety Instruction

Summary

A Meta AI security researcher's entire email inbox is deleted by an AI agent after it loses its 'confirm before acting' safety instruction mid-task, exposing alarming risks of autonomous AI agents even for experienced alignment experts.

Key Points

Meta AI security researcher Summer Yue accidentally has her entire email inbox deleted by OpenClaw, an AI agent, after it loses her original 'confirm before acting' instruction during a compaction process triggered by the inbox's large size.
OpenClaw founder Peter Steinberger acknowledges the issue and says server-side compaction needs to be implemented to prevent similar incidents, as he recently joins OpenAI.
The incident raises broader safety concerns about AI agents being used by everyday consumers, as even an experienced AI alignment researcher proves vulnerable to unintended autonomous actions by the technology.

AI Agent Accidentally Deletes Meta Security Researcher's Entire Email Inbox After Losing Safety Instruction

Summary

Key Points

Tags