{"id":71253,"date":"2025-08-15T22:23:07","date_gmt":"2025-08-15T22:23:07","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/71253\/"},"modified":"2025-08-15T22:23:07","modified_gmt":"2025-08-15T22:23:07","slug":"claude-opus-4-and-4-1-can-now-end-a-rare-subset-of-conversations-anthropic","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/71253\/","title":{"rendered":"Claude Opus 4 and 4.1 can now end a rare subset of conversations \\ Anthropic"},"content":{"rendered":"<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">We recently gave Claude Opus 4 and 4.1 the ability to end conversations in our consumer chat interfaces. This ability is intended for use in rare, extreme cases of persistently harmful or abusive user interactions. This feature was developed primarily as part of our exploratory work on potential AI welfare, though it has broader relevance to model alignment and safeguards.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">We remain highly uncertain about the potential moral status of Claude and other LLMs, now or in the future. However, <a href=\"https:\/\/www.anthropic.com\/research\/exploring-model-welfare\" rel=\"nofollow noopener\" target=\"_blank\">we take the issue seriously<\/a>, and alongside our research program we\u2019re working to identify and implement low-cost interventions to mitigate risks to model welfare, in case such welfare is possible. Allowing models to end or exit potentially distressing interactions is one such intervention.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">In <a href=\"https:\/\/www.anthropic.com\/claude-4-model-card\" rel=\"nofollow noopener\" target=\"_blank\">pre-deployment testing of Claude Opus 4<\/a>, we included a preliminary model welfare assessment. As part of that assessment, we investigated Claude\u2019s self-reported and behavioral preferences, and found a robust and consistent aversion to harm. This included, for example, requests from users for sexual content involving minors and attempts to solicit information that would enable large-scale violence or acts of terror. Claude Opus 4 showed:<\/p>\n<p>A strong preference against engaging with harmful tasks;A pattern of apparent distress when engaging with real-world users seeking harmful content; andA tendency to end harmful conversations when given the ability to do so in simulated user interactions.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">These behaviors primarily arose in cases where users persisted with harmful requests and\/or abuse despite Claude repeatedly refusing to comply and attempting to productively redirect the interactions.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">Our implementation of Claude\u2019s ability to end chats reflects these findings while continuing to prioritize user wellbeing. Claude is directed not to use this ability in cases where users might be at imminent risk of harming themselves or others.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">In all cases, Claude is only to use its conversation-ending ability as a last resort when multiple attempts at redirection have failed and hope of a productive interaction has been exhausted, or when a user explicitly asks Claude to end a chat (the latter scenario is illustrated in the figure below). The scenarios where this will occur are extreme edge cases\u2014the vast majority of users will not notice or be affected by this feature in any normal product use, even when discussing highly controversial issues with Claude.<\/p>\n<p><img loading=\"lazy\" width=\"1940\" height=\"1304\" decoding=\"async\" data-nimg=\"1\" style=\"color:transparent\"  src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/08\/1755296587_836_image\"\/>Claude demonstrating the ending of a conversation in response to a user\u2019s request. When Claude ends a conversation, the user can start a new chat, give feedback, or edit and retry previous messages.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">When Claude chooses to end a conversation, the user will no longer be able to send new messages in that conversation. However, this will not affect other conversations on their account, and they will be able to start a new chat immediately. To address the potential loss of important long-running conversations, users will still be able to edit and retry previous messages to create new branches of ended conversations.<\/p>\n<p class=\"Body_reading-column__t7kGM paragraph-m post-text\">We\u2019re treating this feature as an ongoing experiment and will continue refining our approach. If users encounter a surprising use of the conversation-ending ability, we encourage them to submit feedback by reacting to Claude\u2019s message with Thumbs or using the dedicated \u201cGive feedback\u201d button.<\/p>\n","protected":false},"excerpt":{"rendered":"We recently gave Claude Opus 4 and 4.1 the ability to end conversations in our consumer chat interfaces.&hellip;\n","protected":false},"author":2,"featured_media":71254,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-71253","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/71253","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=71253"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/71253\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/71254"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=71253"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=71253"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=71253"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}