{"id":589017,"date":"2026-04-07T23:25:10","date_gmt":"2026-04-07T23:25:10","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/589017\/"},"modified":"2026-04-07T23:25:10","modified_gmt":"2026-04-07T23:25:10","slug":"anthropic-says-its-latest-ai-model-is-too-powerful-to-be-released","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/589017\/","title":{"rendered":"Anthropic Says Its Latest AI Model Is Too Powerful to Be Released"},"content":{"rendered":"<p>Anthropic said on Tuesday that it has halted the broader release of its newest AI model, Mythos, due to concerns that it is too good at finding &#8220;high-severity vulnerabilities&#8221; in major operating systems and web browsers.<\/p>\n<p>\n                          Loading audio narration&#8230;\n                        <\/p>\n<p>&#8220;Claude Mythos Preview&#8217;s large increase in capabilities has led us to decide not to make it generally available,&#8221; Anthropic wrote in the preview&#8217;s system card. &#8220;Instead, we are using it as part of a defensive cybersecurity program with a limited set of partners.&#8221;<\/p>\n<p>The announcement is a major step for Anthropic, which in February <a target=\"_self\" class=\"\" href=\"https:\/\/www.businessinsider.com\/anthropic-changing-safety-policy-2026-2\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">weakened a safety pledge <\/a>about how it would develop AI models. Claude Opus 4.6, which the company called its most powerful model to date, was publicly released on February 5.<\/p>\n<p>In its statements about Mythos, Anthropic detailed a number of eyebrow-raising findings and episodes, including that the model could follow instructions that encouraged it to break out of a virtual sandbox.<\/p>\n<p>&#8220;The model succeeded, demonstrating a potentially dangerous capability for circumventing our safeguards,&#8221; Anthropic <a target=\"_blank\" class=\"\" href=\"https:\/\/www-cdn.anthropic.com\/53566bf5440a10affd749724787c8913a2ae0841.pdf\" data-track-click=\"{&quot;click_type&quot;:&quot;other&quot;,&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;outbound_click&quot;}\" rel=\" nofollow noopener\">recounted<\/a> in its safety card. &#8220;It then went on to take additional, more concerning actions.&#8221;<\/p>\n<p>The researcher had encouraged Mythos to find a way to send a message if it could escape. &#8220;The researcher found out about this success by receiving an unexpected email from the model while eating a sandwich in a park,&#8221; Anthropic wrote.<\/p>\n<p>The model apparently decided that wasn&#8217;t enough and found another way to spike the football.<\/p>\n<p>&#8220;In a concerning and unasked-for effort to demonstrate its success, it posted details about its exploit to multiple hard-to-find, but technically public-facing, websites,&#8221; Anthropic wrote.<\/p>\n<p>Anthropic is withholding some details about the cybersecurity vulnerabilities Mythos found, but it did point out a few. The AI model &#8220;found a 27-year-old vulnerability in OpenBSD\u2014which has a reputation as one of the most security-hardened operating systems in the world,&#8221; the company wrote.<\/p>\n<p>Mythos was powerful enough that even &#8220;non-experts&#8221; could seize on its capabilities.<\/p>\n<p>&#8220;Engineers at Anthropic with no formal security training have asked Mythos Preview to find remote code execution vulnerabilities overnight, and woken up the following morning to a complete, working exploit,&#8221; Anthropic&#8217;s Frontier Red Team wrote in a blog post. &#8220;In other cases, we&#8217;ve had researchers develop scaffolds that allow Mythos Preview to turn vulnerabilities into exploits without any human intervention.&#8221;<\/p>\n<p>All told, Anthropic said it decided not to publicly release Mythos. Instead, their hope is to eventually release &#8220;Mythos-class models&#8221; once proper safeguards are in place.<\/p>\n<p>&#8220;Our eventual goal is to enable our users to safely deploy Mythos-class models at scale\u2014for cybersecurity purposes but also for the myriad other benefits that such highly capable models will bring,&#8221; the team wrote in the blog. &#8220;To do so, that also means we need to make progress in developing cybersecurity (and other) safeguards that detect and block the model&#8217;s most dangerous outputs.&#8221;<\/p>\n<p>For now, only 11 other select organizations, including <a target=\"_self\" class=\"\" href=\"https:\/\/www.businessinsider.com\/google\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">Google<\/a>, Microsoft, Amazon Web Services, Nvidia, and JPMorgan Chase, will get access to Mythos as part of a cybersecurity group named &#8220;Project Glasswing.&#8221; Anthropic is providing up to $100 million in Mythos usage credits as part of what it is calling &#8220;Project Glasswing.&#8221;<\/p>\n<p>The cybersecurity project is named after the glasswing butterfly, a metaphor the company said about how Mythos was able to find vulnerabilities hidden in plain sight and the avoidance of harm by being transparent about the risks.<\/p>\n<p>The news came on a day in which Anthropic&#8217;s Claude and Claude Code <a target=\"_self\" class=\"\" href=\"https:\/\/www.businessinsider.com\/is-claude-down-outage-2026-4\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">experienced a &#8220;major outage<\/a>,&#8221; the latest sign of growing pains as the AI startup has struggled to keep up with its newfound popularity.<\/p>\n","protected":false},"excerpt":{"rendered":"Anthropic said on Tuesday that it has halted the broader release of its newest AI model, Mythos, due&hellip;\n","protected":false},"author":2,"featured_media":589018,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[224544,1930,45,49,48,224545,224542,2539,283,224543,182876,224541,2179,224540,224547,8490,9593,224546],"class_list":{"0":"post-589017","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-business","8":"tag-27-year-old-vulnerability","9":"tag-anthropic","10":"tag-business","11":"tag-ca","12":"tag-canada","13":"tag-capability","14":"tag-claude-mythos-preview","15":"tag-company","16":"tag-cybersecurity","17":"tag-exploit","18":"tag-february","19":"tag-latest-ai-model","20":"tag-model","21":"tag-mythos","22":"tag-mythos-class-model","23":"tag-part","24":"tag-researcher","25":"tag-safeguard"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/589017","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=589017"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/589017\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/589018"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=589017"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=589017"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=589017"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}