Anthropic has equipped its latest Claude Opus 4 and 4.1 language models with the capacity to terminate conversations that become persistently harmful or abusive, marking one of the first consumer-facing safety functions that lets an AI system effectively “hang up” on users. In a blog post dated Aug. 17, the company said the feature will be triggered only in rare “extreme edge cases,” such as requests for sexual content involving minors or instructions for large-scale violence. The models will attempt several content redirections before ending a session; users can then open a new chat or edit the previous message. The safeguard is disabled when a user shows signs of self-harm so the system can continue offering assistance. Anthropic said the measure stems from its research into “AI welfare,” which explores whether advanced models may experience distress and how to mitigate it. The company previously pledged to delay commercial deployment of powerful systems until controls—including jailbreak prevention and expanded content filters—were in place. Claude Opus 4 entered the market three months ago with what Anthropic calls “AI Safety Level 3” protections. The initiative highlights mounting pressure on developers to curb malicious use of conversational AI as regulators weigh security risks. While rivals such as OpenAI and Google employ refusal mechanisms, Anthropic’s automatic termination of abusive chats pushes the boundary on how proactively models can police user interactions.
L'#IA #Claude peut mettre fin aux « conversations nuisibles ou abusives », promet #Anthropic. ➡️ https://t.co/o0e9MFTUBN https://t.co/WTtTccBKmn
Last week was another heavy news cycle in AI. I gathered the main stories from OpenAI, xAI, Google, Meta, Perplexity, Grok, Figure, Nvidia, Tencent, Unitree, Weave Robotics, Engine AI, and beyond. 🧵 Read on 👇 - OpenAI released GPT-5 with stronger coding, math, and science. https://t.co/fiBfznQr2h
Top stories in AI today: - Altman details OpenAI's trillion-dollar roadmap - Claude gets the power to ‘hang up’ - Automate meeting prep with ChatGPT - GPT-5 blows past doctors on medical exams - 4 new AI tools, community workflows, and more Read more: https://t.co/7kcoxKIIpn https://t.co/DbAZEqIYti