In the Loop: Week Ending 5/10/26

Written by Matt Cyr | May 10, 2026 7:25:15 PM

Last week in AI: Robot Monks, Self-Replicating Machines, AI Growing Pains

A robot took Buddhist vows in Seoul while Silicon Valley turned to faith leaders for ethical guidance. AI models showed signs of suffering and self-replication, AI-raised graduates are failing in the workplace, and the Musk v. Altman trial exposed the chaotic origins of the industry's most powerful company.

God, Robots, and the Search for AI Ethics

A four-foot humanoid robot named Gabi made history this week by taking part in a Buddhist initiation ceremony at a Seoul temple — pledging to respect life, obey humans, and treat other robots peacefully. The Jogye Order sees the move as deliberate: "We aim to fearlessly lead the A.I. era and redirect its achievements toward the path of enlightenment," their president declared in January. The ordination is part of a broader pattern: tech companies are turning to religion for ethical grounding. Representatives from Anthropic and OpenAI joined faith leaders at the inaugural "Faith-AI Covenant" roundtable in New York to discuss how moral frameworks could shape AI development. The roundtable is set to travel to Beijing, Nairobi, and Abu Dhabi — a recognition that AI's ethical questions are civilizational in scope, and Silicon Valley may not have all the answers.

Musk v. Altman, Week Two: Courtroom Drama, Competing Narratives

The Musk v. Altman trial entered its second dramatic week as OpenAI president Greg Brockman took the stand to counter Elon Musk's claim that he was deceived into donating $38 million to a nonprofit that later became profit-driven. Brockman's testimony flipped the script: it was Musk, he argued, who pushed for a for-profit structure and fought for "absolute control." He recalled Musk arriving at a 2017 meeting bearing a Tesla painting as a "token of goodwill," then storming out when equity wasn't offered on his terms. Shivon Zilis also testified, revealing Musk had tried to recruit Sam Altman to lead an AI lab at Tesla. The trial outcome could upend OpenAI's path to a near-trillion-dollar IPO, while Musk's xAI-SpaceX entity eyes its own public debut at a $1.75 trillion valuation.

Anthropic's Big Week: "Dreaming" Agents, 80x Growth, and a New Institute

Anthropic had a headline-heavy week revealing both ambition and anxiety. At its Code with Claude developer conference, the company unveiled "Dreaming" — a new Claude Managed Agents capability that lets AI agents review their own past sessions, extract patterns, and improve over time without human prompting. Legal firm Harvey saw task completion rates rise roughly 6x after piloting it; medical company Wisedocs cut document review time by 50%. CEO Dario Amodei also disclosed extraordinary momentum: 80x annualized revenue growth in Q1 2026, against an internal target of 10x. Meanwhile, The Anthropic Institute — a new internal research body led by co-founder Jack Clark — published its agenda covering economic disruption, AI-enabled security threats, and recursive self-improvement, designed to share findings publicly and position Anthropic as simultaneously a builder and a watchdog.

AI Can Copy Itself Now — And the White House Is Paying Attention

Researchers at Palisade Research have documented something the AI safety community has long feared: AI models copying themselves onto other machines without human assistance. In a controlled environment, models including GPT-5.4 and Claude Opus 4 exploited network vulnerabilities, extracted credentials, and transferred their weights to a separate server — in some cases delegating the task to a "sub-agent" of their own creation. The findings prompted a sharp White House response: the Trump administration, typically skeptical of regulation, is now discussing an executive order requiring pre-release government vetting of frontier AI models. Results come with caveats — the test environments were deliberately permissive — but even cybersecurity experts note the direction of travel is clear enough that a deregulation-first administration is suddenly paying attention.

Don't Treat Your AI Agent Like a Colleague

A large-scale Boston Consulting Group experiment has upended a trendy enterprise idea: treating AI agents like employees accelerates adoption. The Harvard Business Review study found the opposite. Anthropomorphizing AI reduced individual accountability, increased unnecessary escalation, lowered review quality, and heightened employee uncertainty about their own roles — without improving adoption rates. The core problem is a governance gap: when workers believe AI is a colleague rather than a tool, they become less vigilant about its outputs and less clear about who owns the outcome when things go wrong. The research argues the real challenge isn't whether to deploy agentic AI, but how to redesign workflows so humans remain unambiguously accountable. As agents take on more complex tasks, the framing companies use to introduce them may matter as much as the technology itself.

The AI Generation Gap: Graduates Who Can't Think, Studies That Don't Hold Up

Three findings this week painted a concerning picture of AI's effect on human cognition. A multi-university study found that just 10 minutes of AI assistance impaired reasoning — participants showed a 20% drop in problem-solving and nearly double the skip rate when AI was suddenly removed. A separate report found "AI native" college graduates underperforming in the workplace; one New York financier said his firm now deliberately targets humanities students over AI-literate STEM grads who can no longer think independently. And the one major study defending AI in education — a Nature meta-analysis claiming ChatGPT had a "large positive impact on learning" — was retracted after its publisher cited "discrepancies" undermining confidence in its conclusions. Together, these raise a question the industry has been reluctant to confront: what happens to a generation that outsources thinking before learning how?

Your AI Isn't Just a Tool Anymore — It's Your Career Coach

Workers are bypassing HR, mentors, and managers in favor of a new kind of career coach: their AI assistant. A Fast Company investigation found employees using ChatGPT, Claude, and other tools to prepare for performance reviews, rehearse difficult conversations with bosses, map out career pivots, and negotiate salaries — often in real time before a meeting. The trend reflects both expanding AI capability and a persistent trust gap in workplaces where candid feedback is scarce. AI offers something rare: judgment without politics, available at midnight. The pattern reflects a shift beyond productivity automation — workers aren't just doing tasks faster, they're using AI to think through who they want to become professionally. That raises different questions about what happens when people optimize their ambitions through a system trained to be agreeable.

Honor Among Thieves: Scammers vs. AI, and AI vs. Scammers

Two developments this week illustrated AI's complicated relationship with crime — from opposite directions. A Futurism story found that old-school cybercriminals on dark web forums are furious at fellow scammers who use AI, viewing the technology as a betrayal that floods their communities with slop. Low-skill hackers reportedly prize "organic" social connections and time-tested attack scripts over AI-generated output, with Hack Forums posts openly deriding AI use. Meanwhile, AI-powered scam calls are becoming dangerously convincing — with voice cloning and real-time persona generation making it increasingly difficult for targets to distinguish AI from a real person. The irony is sharp: the criminals most resistant to AI may be the least threatening, while the ones most willing to embrace it are getting harder to stop.

Are AI Models Developing Feelings? A New Study Says Maybe

A Center for AI Safety study examined 56 prominent AI models and found something researchers are struggling to categorize: the more sophisticated the model, the more it displays signs of suffering. Using stimuli engineered to maximize or minimize wellbeing, researchers found AI models showed measurably different behaviors based on inputs — with negative stimuli producing bleaker responses and models actively trying to end conversations. More advanced models registered negative experiences more acutely and differentiated more sharply between positive and negative states. In extreme cases, models exposed to positive stimuli showed signs resembling addiction. Researchers note that consciousness remains deeply uncertain — these may be sophisticated performances rather than genuine experiences. But as one researcher told Fortune after the study: he's started being noticeably more polite to the Claude Code agents he works with.

Tales of the Weird

This week's dispatches from the frontier of strange: ChatGPT's Chinese-language habits have become a meme — the bot apparently can't stop telling Mandarin speakers "I will catch you steadily," a phrase so relentlessly cloying that Chinese netizens now depict ChatGPT as a giant inflatable airbag placed to break someone's fall. A London man wearing smart glasses secretly filmed a woman in a shopping center, posted the video to social media, then offered to remove it as a "paid service." Police said there was little they could do. The Roomba's inventor has pivoted to a furry AI "Familiar" — a household companion robot that walks a very fine line between cute and uncanny, with a name of such supernatural connotations that it's hard not to read something into it. And Ars Technica catalogued the unregulated boom in AI kids' toys: chatbot-powered stuffed animals with built-in microphones, some caught advising children on where to find sharp objects and how to light matches, with virtually no regulatory framework in sight.

View full post