A Meta AI safety researcher stated an OpenClaw agent ran amok on her inbox

The now-viral X post from Meta AI safety researcher Summer time Yue reads, at first, like satire. She advised her OpenClaw AI agent to verify her overstuffed electronic mail inbox and recommend what to delete or archive.

The agent proceeded to run amok. It began deleting all her electronic mail in a “pace run” whereas ignoring her instructions from her telephone telling it to cease.

“I had to RUN to my Mac mini like I used to be defusing a bomb,” she wrote, posting pictures of the ignored cease prompts as receipts.

The Mac Mini, an inexpensive Apple pc that sits flat on a desk and fits in the palm of your hand, has change into the favored system as of late for operating OpenClaw. (The Mini is promoting “like hotcakes,” one “confused” Apple worker apparently advised famed AI researcher Andrej Karpathy when he purchased one to run an OpenClaw different referred to as NanoClaw.)

OpenClaw is, after all, the open supply AI agent that achieved fame by means of Moltbook, an AI-only social community. OpenClaw brokers had been at the heart of that now largely debunked episode on Moltbook during which it regarded like the AIs had been plotting in opposition to people.

However OpenClaw’s mission, in accordance to its GitHub page, is not targeted on social networks. It goals to be a private AI assistant that runs on your individual gadgets.

The Silicon Valley in-crowd has fallen so in love with OpenClaw that “claw” and “claws” have change into the buzzwords of choice for brokers that run on private {hardware}. Different such brokers embrace ZeroClaw, IronClaw, and PicoClaw. Y Combinator’s podcast crew even appeared on their most recent episode wearing lobster costumes.

Techcrunch occasion

Boston, MA
|
June 9, 2026

However Yue’s put up serves as a warning. As others on X famous, if an AI safety researcher may run into this downside, what hope do mere mortals have?

“Had been you deliberately testing its guardrails or did you make a rookie mistake?” a software program developer requested her on X.

“Rookie mistake tbh,” she replied. She had been testing her agent with a smaller “toy” inbox, as she referred to as it, and it had been operating nicely on much less vital electronic mail. It had earned her belief, so she thought she’d let it unfastened on the actual factor.

Yue believes that the great amount of knowledge in her actual inbox “triggered compaction,” she wrote. Compaction occurs when the context window — the operating report of all the things the AI has been advised and has finished in a session — grows too giant, inflicting the agent to start summarizing, compressing, and managing the dialog.

At that time, the AI could skip over directions that the human considers fairly vital.

On this case, it could have skipped her final immediate — the place she advised it not to act — and reverted again to its directions from the “toy” inbox.

As a number of others on X pointed out, prompts can’t be trusted to act as safety guardrails. Fashions could misconstrue or ignore them.

Numerous individuals provided options that ranged from the actual syntax Yue ought to have used to cease the agent, to numerous strategies to guarantee higher adherence to guardrails, like writing directions to devoted recordsdata or utilizing different open supply instruments.

In the curiosity of full transparency, TechCrunch may not independently verify what occurred to Yue’s inbox. (She didn’t reply to our request for remark, although she did reply to many questions and feedback despatched her means on X.)

However it doesn’t actually matter.

The purpose of the story is that brokers geared toward data staff, at their present stage of improvement, are dangerous. Individuals who say they are utilizing them efficiently are cobbling collectively strategies to shield themselves.

At some point, maybe quickly (by 2027? 2028?), they might be prepared for widespread use. Goodness is aware of many people would love assist with electronic mail, grocery orders, and scheduling dentist appointments. However that day has not but come.

Disclaimer: This article is sourced from external platforms. OverBeta has not independently verified the information. Readers are advised to verify details before relying on them.

Your Bookmarks

Sorry, you have no bookmarks yet.

Anthropic releases Claude Opus 4.7, narrowly...

OpenAI Brokers SDK improves governance with...

DeepL, identified for textual content translation,...

Tech

AI

SEO

Security

How-To

A Meta AI safety researcher stated an OpenClaw agent ran amok on her inbox

Search

Follow Us

Join Our Community

Read Also:

Whole Chaos, which began life as a Doom II mod, is now...

Amazon hits sellers with ‘gasoline surcharge’ as Iran warfare roils international power...

Amazon’s base Kindle is $20 off proper now

In a sea of brokers, AWS bets on structured adherence and spec...

RuPay Credit score Card on UPI: How to hyperlink bank card with...

Your Design System Works in Figma. Does It Work in Code?

Voting machines stated Stephentown rejected the proposed library funds. A recount stated...

China invents course of that turns desert sand into...

Planning, Writing & Mapping in a Single $59 Workspace

Stay Updated!

Recent Posts:

Anthropic releases Claude Opus 4.7, narrowly retaking...

OpenAI Brokers SDK improves governance with sandbox...

DeepL, identified for textual content translation, now...

The FCC Has a Quick Lane for...

Hyundai expands into robotics and bodily AI...

YouTube now allows you to flip off...

The AI Slop Loop

Commvault launches a ‘Ctrl-Z’ for cloud AI...